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(54) Eukaryotic transposable element 

(57) Disclosed are isolated transposable elements, 
isolated DNA sequences which encode a transposase 
protein (or a portion of a transposase protein), a purified 
transposase protein, or peptide fragments thereof, 
encoded by the DNA sequences. The isolated transpos- 
able elements and the isolated DNA sequences are 
characterized by the ability to hybridize to the DNA 
sequence of Minos-1. Further disclosed are methods of 
gene tagging, insertional mutagenesis and exon trap- 
ping. Transgenic animals and transgenic plants are also 
disclosed. 
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Description 

BACKGROUND OF THE INVENTION 

5 [0001] The Tc1 -like family of transposons and the retroviral-like transposons are unique for their wide dispersion in 
diverse organisms. Members belonging to the Tc-1 -like family have been characterized in nematodes, diptera, fish and 
amphibians: Tc1 in Caenorhabditis elegans, TCb1 in Caenorhabditis briggsae, HB1 in Drosophila melanogaster, Uhu 
in Drosophila heteroneura, Minos in Drosophila hydei, and Tes1 in the Pacific hagfish Eptatetrus stouti. All are charac- 
terized by a relative short length (1 .6 to 1.8 kb), the presence of inverted terminal repeats, and significant sequence 

10 similarity in the region between the repeats. 

[0002] The Minos- 1 transposable element has been identified as a 1775 bp dispersed repetitive sequence inserted 
within the transcribed spacer in one of the repeats of Drosophila hydei (Franz and Savakis, Nucl. Acids Res. 19: 6646 
(December 11, 1 991)). The element is characterized by 255-bp long perfect inverted repeats and the presence of two 
long, non-overlapping open reading frames (ORFs) on the same strand. The longest of the ORFs shows approximately 

is 30% sequence identity with TcA, but does not begin with an ATG codon. It appears, therefore, that the cloned element 
represents a defective member of the Minos family, as is the case with all previously sequenced Tc1 -like elements, with 
the possible exceptions of Tc1 and Tcb1. 

[0003] Transposable elements are natural components of genomes ranging from bacteria to vertebrate organisms 
(Lewin, Genes VI, Chapter 18, Oxford University Press, (1997)). Thus, due to their widespread phylogenetic distribu- 
te tion, evolutionary conservation and genomic mobility, transposons are valuable tools for genetic manipulations, such as, 
for example, the integration of nucleic acids in germ cells for the production of transgenic animals, and genetic transfor- 
mation and insertional mutagenesis in somatic cells and viral vectors for use as therapeutics. 

SUMMARY OF THE INVENTION 

25 

[0004] The invention relates to an isolated transposable element, or an isolated DNA sequence which encodes a 
transposase protein (or a portion of a transposase protein), to a purified transposase protein, or peptide fragments 
thereof, encoded by such a DNA sequence, and to methods of using the transposable element and transposase pro- 
tein. The isolated transposable element and the isolated DNA sequence are characterized by the ability to hybridize to 

30 the DNA sequence of Minos-1 under stringent hybridization conditions. 

[0005] In another aspect, the invention relates to a method for the stable introduction of a nucleic acid sequence of 
interest into a cell. This method involves the use of an isolated transposable element of the type described in the pre- 
ceding paragraph, the isolated transposable element being modified to include the nucleic acid sequence of interest 
flanked by the termini of the isolated transposable element. This modified transposable element is introduced into the 

35 cell in the presence of a transposase protein, or a nucleic acid sequence or a virus encoding a transposase protein. The 
role of the transposase protein is to catalyze the transposition of the modified transposable element containing the 
nucleic acid sequence of interest into the genome of the cell. Also envisioned are cells produced by this method. 
[0006] In a third aspect, the invention relates to a method for isolating members of the Tc-1 family of transposable 
elements from genomic DNA of a eukaryote of interest. According to this method, oligonucleotide primers are provided 

40 which are complementary to a sequence of at least about 1 2 consecutive nucleotides which encode amino acids which 
are highly conserved in aligned sequences of nematode Tc-1 family members and Minos family members. These oli- 
gonucleotide primers are used to prime amplification by the polymerase chain reaction (PCR). The amplification prod- 
ucts are then used to isolate DNA encoding the entire Tc-1 family member from the eukaryote of interest by 
conventional methods. 

45 [0007] In a fourth aspect, the invention relates to a transgenic animal, which is produced by a method which involves « 
the use of an isolated transposable element characterized by the ability to hybridize to the DNA sequence of Minos-1 , 
the isolated transposable element being modified to include the nucleic acid sequence of interest flanked by the termini 
of the isolated transposable element. This modified transposable element is introduced into a cell in the presence of a 
transposase protein, or a DNA sequence or a virus encoding a transposase protein. 

so [0008] In a fifth aspect, the invention relates to methods of integrating a nucleic acid sequence of interest into a chro- 
mosome of a cell. This method involves the use of an isolated transposable element of the type described in the pre- 
ceding paragraph, the isolated transposable element being modified to include the nucleic acid sequence of interest 
flanked by the termini of the isolated transposable element. This modified transposable element is introduced into the 
cell in the presence of a transposase protein, or a nucleic acid sequence or a virus encoding a transposase protein. 

55 Also envisioned are ceils produced by this method. 

[0009] In a sixth aspect, the invention relates to a transgenic plant, which is produced by a method which involves the 
use of an isolated transposable element characterized by the ability to hybridize to the DNA sequence of Minos-1, the 
isolated transposable element being modified to include the nucleic acid sequenc of interest flanked by the termini of 
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the isolated transposable element. This modified transposable element is introduced into a plant cell in the presence of 
a transposase protein, or a nucleic acid sequence or a virus encoding a transposase protein. 
[0010] In a seventh aspect, the invention relates to insertional mutagenesis and gene tagging. In this approach, the 
Minos-transposable elements are inserted into a nucleic acid (e.g., a gene) to induce a mutation in the nucleic acid 
5 which produces a phenotypic alteration. The location of the nucleic acid is identified by the presence of the Minos trans- 
poson sequence. The Minos transposon sequence can be identified, for example, using standard molecular hybridiza- 
tion techniques, such as in situ hybridization, Southern blotting, and colony hybridization. The terms "transposable 
element" and "transposon" are used interchangeably herein. 

[0011] In a particular embodiment, this aspect of the invention relates to methods for inducing a mutation of interest 

w in a cell (a Minos transposon-induced mutation), and identifying and isolating a gene of interest which includes the 
mutation from the cell. The methods involve the use of an isolated transposable element of the type described above 
which is introduced into a cell in the presence of a transposase protein, or a nucleic acid sequence or a virus encoding 
the transposase protein. In a particular embodiment, the transposable element is modified to include a promoter oper- 
ably linked to an indicator gene (such as a reporter gene or a selectable marker gene) flanked by the inverted terminal 

is repeats of the isolated transposable element. In a further embodiment, expression of the indicator gene is detected, 
thereby identifying cells in which the transposable element has integrated into the genome of the cells. Cells which have 
a mutation of interest can then identified and selected by looking for a particular phenotype conferred by the mutation 
but not the corresponding endogenous gene. These cells are referred to herein as cells including a Minos transposon- 
induced (or Minos transposable element-induced) mutation. The location of the gene which includes the mutation can 

20 then be identified by the presence of the Minos transposon sequence and then isolated. 

[0012] In a second embodiment, this aspect of the invention relates to methods for selecting an insertional mutation 
in a gene (a Minos transposon-induced mutation). The methods comprise introducing a transposable element of the 
type described above, modified to include a minimal promoter or a splice acceptor site operably linked to an indicator 
gene flanked by the inverted terminal repeats of the isolated transposable element, into a population of cells in the pres- 

25 ence of a transposase protein, or a nucleic acid sequence or a virus encoding the transposase protein. Expression of 
the indicator gene is detected, thereby identifying cells in which the transposable element has integrated near or within 
a particular gene in the cells. These cells are also referred to herein as cells including a Minos transposon-induced (or 
Minos transposable element-induced) mutation. The location of the gene in which the transposable element has inte- 
grated near or within is identified by the presence of the Minos transposon sequence and then isolated. 

30 [0013] In an eighth aspect, the invention relates to methods for reversing a Minos transposon-induced mutation in a 
cell. The methods comprise introducing a transposase protein, or a nucleic acid sequence or a virus encoding a trans- 
posase protein, into cells identified as including a Minos transposon-induced mutation, as described herein. The trans- 
posase protein catalyzes reversion of the Minos transposon-induced mutation. Ceils in which reversion of the mutation 
has occurred can be identified, for example, by looking for loss of a particular phenotype conferred by the mutation or 

35 for absence of the product encoded by the indicator gene. 

[0014] In a ninth aspect, the invention relates to a method for introducing a reversible mutation in a gene of interest 
in a cell. In a particular embodiment, this method involves the use of a Minos transposable element modified to include 
a promoter operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated transposable 
element. In a second embodiment, this method involves the use of a Minos transposable element modified to include a 

40 minimal promoter operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated transpos- 
able element. In a third embodiment, the method involves the use of a Minos transposable element modified to include 
a splice acceptor site operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated trans- 
posable element. The modified transposable element is introduced into a gene of interest, thereby producing a mutated 
gene. The mutated gene is introduced into a sample of cells under conditions sufficient for homologous recombination 

45 between the mutated gene and the endogenous gene. Thus, in this aspect of the invention, the gene of interest is a 
gene which has sufficient sequence homology to the endogenous gene in which a reversible mutation is to be intro- 
duced, for homologous recombination between the endogenous gene and the mutated gene. Cells in which the endog- 
enous gene has been replaced with the mutated gene can then identified and selected by looking for a particular 
phenotype conferred by the mutated gene, for loss of a particular phenotype conferred by the endogenous gene or for 

so presence of a product encoded by the indicator gene. The mutation in the gene introduced in accordance with the 
present method can be reversed by a method comprising introducing a transposase protein, or a nucleic acid sequence 
or a virus encoding a transposase protein, into cells in which the endogenous gene has been replaced with the mutated 
gene. Cells in which reversion of the mutation has occurred can be identified, for example, by looking for loss of a par- 
ticular phenotype conferred by the mutated gene, for a particular phenotype conferred by the endogenous gene or for 

55 absence of the product encoded by the indicator gene. 

[001 5] In a tenth aspect, th invention relates to a method for inducing loss of a nucleic acid sequence of inter st inte- 
grated into the chromosome of a cell. In a particular embodiment, the nucleic acid sequence of interest refers to a gene 
of interest. The method comprises introducing a transposase protein, or a nucleic acid sequenc or a virus encoding a 
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transposase protein, into cells identified as including a nucleic acid sequenc of interest which was integrated into the 
chromosome of a cell using an isolated Minos transposable element of the type described above. Cells in which loss of 
the nucleic acid sequence of interest has occurred can be identified, for example, by looking for loss of a particular phe- 
notype conferred by the nucleic acid sequence of interest or for absence of the product encoded by an indicator gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 



[0016] 

10 Figure 1A-1C is a diagram providing the consensus sequence of elements Minos-1, Minos-2 and Minos-3 with 
nucleotide deletions after nucleotides 365, 678 and 71 5. The terminal inverted repeats and the intron sequence are 
shown in small letters. Differences between the three elements are indicated above and below the nucleotide 
sequence. More specifically, nucleotide 896 is a G in Minos-2 and Minos-3 and an A in Minos-1. Nucleotide 1 157 
is a C in Minos-1 and Minos-3 and a T in Minos-2. 

15 Figure 2A-2C is a diagram providing the consensus sequence of elements Minos-1 , Minos-2 and Minos-3. The ter- 
minal inverted repeats and the intron sequence are shown in small letters. The first and last nucleotides of the 
sequence, A and T, respectively, are generated by a duplication of the chromosomal target site TA during insertion 
of the element. The deduced amino acid sequence of two open reading frames is shown above the nucleotide 
sequence. Differences between the three elements are indicated above and below the nucleotide sequence. More 

20 specifically, nucleotide 900 is a G in Minos-2 and Minos-3 and an A in Minos-1 . Nucleotide 1 161 is a C in Minos-1 
and Minos-3 and a T in Minos-2. Amino acid residue 1 48 is a tryptophan in Minos-2 and Minos-3 and a stop codon 
in Minos-1 . Amino acid residue 235 is a serine in Minos-1 and Minos-3 and a leucine in Minos-2. 
Figure 3A is a diagram of the insert of the transposon plasmid pMihsCcw. ML and MR signify the left- and right-end 
parts of Minos, respectively. Speckled boxes indicate the D. melanogaster HsplO promoter (Hsp70-P) and termi- 

25 nator (Hsp70-T) sequences. Wide hatched bars indicate the Minos (M) and Medf ly white (W) sequences that were 
used as probes for the analysis of transfbrmants. 

Figure 3B is a diagram of the insert of the Minos helper plasmid pHSS6hsMi. SpecWed box indicates the D. mela- 
nogaster HsplO promoter (Hsp70-P) sequence. Salient restriction sites are shown. Exon 1 and exon 2 are also 
referred to herein as open reading frame 1 (ORF1) and open reading frame 2 (ORF2), respectively. IR indicates the 
30 right-hand terminal inverted repeat. 

Figure 4 is a bar graph depicting the frequencies of transformants among G1 progeny. Bars indicate the numbers 
of G1 flies from the individual cages. The sex of the GO flies in each cage is indicated. The numbers above cages 
1 , 3, 25 and 33 indicate the w + flies that were recovered from these cages. 

Figure 5 is a diagram of the helper plasmid pEFVILMi. The arrow represents the transcription start site and indi- 
35 cates the direction of transcription of the Minos (Mi) transposase gene which is linked to the human translation 
elongation factor (EFI)-promoter. The fragment containing the EF1 promoter also comprises a 943 bp intron in the 
5' untranslated region which provides an intron for the transposase RNA transcript. Upstream from the EF-1 pro- 
moter is the SV40 origin of replication (ori). The 3* end of the Minos transposase gene contains a polyadenylation 
signal from the human granulocyte colony-stimulating factor gene (hG-CSF). 
40 Figure 6 is a diagram of the transposon plasmid pMiLRneo. The arrowhead represents the transcription start site 
and indicates the direction of transcription. The plasmid contains the neomycin (neo) resistance gene under the 
control of the early SV40 promoter, flanked by two inverted repeats of the Minos transposable element (bolded 
arrows). 

Figure 7 is a diagram of the plasmid pMiLneo. The arrowhead represents the transcription start site and indicates 
45 the direction of transcription. The pMiLneo plasmid is derived from pMiLRneo (Figure 6) by deletion of the right 
hand repeat of Minos to generate the defective transposon "wings clipped". The remaining left hand inverted repeat 
is indicated by the bolded arrow. 

SEQUENCE LISTING CROSS-REFERENCE 

50 

[0017] In portions of the Specification, the following sequence listing cross-reference is applicable: 



SEQ ID NO: 1 Nucleic acid sequence of Minos-1 with nucleotide deletions after nucleotides 365, 678 and 715. 

SEQ ID NO: 2 Nucleic acid sequence of Minos-2 with nucleotide deletions after nucleotides 365, 678 and 715. 

55 SEQ ID NO: 3 Nucleic acid sequence of Minos-3 with nucleotide deletions after nucleotides 365, 678 and 715. 

SEQ ID NO: 4 Nucleic acid sequence of Minos-1 . 

SEQ ID NO: 5 Deduced amino acid sequence of Minos-1. 

SEQ ID NO: 6 Nucleic acid sequence of Minos-2. 
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SEQ ID NO: 7 Deduced amino acid sequence of Minos-2. 

SEQ ID NO: 8 Nucleic acid sequence of Minos-3. 

SEQ ID NO: 9 Deduced amino acid sequence of Minos-3. 

SEQ ID NO: 10 MVWGC. 

SEQ ID NO: 11 WPSQSPDL 

SEQ ID NO: 12 WPSNSPDL 



DETAILED DESCRIPTION OF THE INVENTION 



10 [001 8J The invention disclosed herein is based on the initial discovery of Minos-1 , an apparently defective member of 
the Tc-1 family of transposable elements. This 1 779-hp element is characterized by perfect inverted repeats of 255-bp 
at each termini. The sequence encodes two non-overlapping reading frames, one of which has significant similarity with 
the putative transposase encoded by the transposable element Tc1 of Caenorhabiditis efegans. However, the Minos-1 
element, because of a stop codon within the putative transposase gene, apparently cannot encode an active trans- 

is posase. 

[001 9] In an effort to identify sequences related to the Minos-1 sequence, genomic DNA of D. hydei was probed with 
a portion of the Minos-1 sequence under stringent hybridization conditions. As discussed in detail in the Examples sec- 
tion which follows, two full-length related sequences were identified, both of which encode an active transposase. 



20 ISOLATED NUCLEIC ACIDS AND USES THEREOF 



[0020] Thus, in one aspect, the subject invention relates to an isolated transposable element which hybridizes to the 
DNA sequence of Minos-1 under stringent hybridization conditions. As used herein, stringent hybridization conditions 
are considered to be hybridization in a buffered solution of 0.9 M NaCI at 55 °C. In D. hydei there are up to 30-copies 

25 detected which hybridize to Minos thus, it is likely that a large number of variants can be isolated using these conditions. 
Comparable hybridization stringency can be established at other salt concentrations and temperatures. This is accom- 
plished, for example, by the inclusion of organic denaturants such as formamide in the hybridization buffer. Nucleic acid 
sequences which hybridize to the Minos-1 sequence under stringent hybridization conditions are referred to herein as 
members of the Minos family of transposable elements. Nucleic acid sequences which hybridize to the Minos-1 

30 sequence under stringent hybridization conditions include, for example, the Minos-2 and Minos-3 DNA sequences. 
Other examples of nucleic acid sequences which hybridize to the Minos-1 sequence under stringent hybridization con- 
ditions include Minos-1 . Minos-2 and Minos-3 DNA sequences having base deletions, insertions and/or substitutions. 
[0021 ] The term transposable element, as used herein, refers to a DNA sequence whose excision from/insertion into 
genomic DNA is catalyzed by a functional transposase protein encoded by a non-defective member of the Minos family 

35 of transposable elements. A member of the Minos family which encodes a functional transposase and possesses other 
necessary cis-acting elements (e.g., inverted terminal repeats) falls within this definition. In addition, a transposable ele- 
ment which encodes a defective transposase (e.g., Minos-1 itself) falls within this definition. As discussed in greater 
detail below, such defective transposable elements can be used in conjunction with a helper element (e.g., a member 
of the Minos family which encodes a functional transposase) to introduce a nucleic acid sequence of interest into a cell 

40 (e.g. a eukaryotic cell such as an animal, plant or yeast cell or a prokaryotic cell such as a bacterial ceil). 

[0022] The invention also relates to an isolated DNA sequence encoding a functional transposase protein, or a portion 
of a transposase protein, encoded by a member of the Minos family. Such a DNA sequence need not retain the ability 
to transpose in the presence of the encoded transposase protein. A sequence encoding a functional transposase pro- 
tein can be used to prepare an expression construct which can be used to produce the transposase protein by recom- 

45 binant DNA methodology. Such a recombinant protein can be over-produced in a eukaryotic (e.g., yeast) or prokaryotic 
host cell (e.g., £ coli), and subsequently purified by conventional methods. 

[0023] The active transposase can be used in a variety of ways. For example, as discussed below, the transposase 
can be co-introduced into a eukaryotic cell with a modified transposon carrying a nucleic acid sequence of interest to 
catalyze the insertion of the modified transposon into the genomic DNA of the eukaryotic cell. This is an alternative to 
so the co-introduction of a helper construct in eukaryotic cells which do not constitutively produce the Minos transposase. 
[0024] In addition, the transposase, or portions thereof, can be used to produce antibodies (monoclonal and polyclo- 
nal) reactive with the transposase protein. Methods for the production of monoclonal and polyclonal antibodies are 
straightforward once a purified antigen is available. 

[0025] Through the isolation and DNA sequence analysis of additional members of the Minos family, refinement of 
55 the consensus sequence of Figure 2A-2C is possible. This refined consensus sequence can be used to predict modifi- 
cations of the transposas protein which will affect the specific activity of the transposas . Such predictions are easily 
tested by modifying the DNA sequence of an expression construct encoding the transposase by site-dir cted mutagen- 
esis to either bring the sequence into a greater degree of conformance with the consensus sequence, or a lesser 
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degree of conformance with the consensus sequence. The affect of such changes on the activity of the transposase 
protein are monitored by assessing the affect of the mutation on transposition frequency catalyzed by the recombinant 
transposase. 

5 METHODS FOR THE INTRODUCTION OF NUCLEIC ACID SEQUENCES INTO A CELL 

[0026] Transposable elements of the Minos family, and the active transposase encoded by such elements, are useful 
in methods for introducing a nucleic acid sequence of interest into a cell (e.g., a eukaryotic cell, such as an animal, plant 
or yeast cell or a prokaryotic cell, such as a bacterial cell). Typically, the nucleic acid sequence of interest will be a gene 

w which encodes a protein. Such a gene can be placed under the regulatory control of a promoter which can be induced 
or repressed, thereby offering a greater degree of control with respect to the level of the protein in the cell. As used 
herein, the term "promoter" refers to a sequence of DNA, usually upstream (5') of the coding region of a structural gene, 
which controls the expression of the coding region by providing recognition and binding sites for RNA polymerase and 
other factors which may be required for initiation of transcription. The selection of the promoter will depend upon the 

15 nucleic acid sequence of interest. In addition to a nucleic acid sequence encoding a protein, any other nucleic acid 
sequence can be introduced by this method including, for example, regulatory sequences. 

[0027] Nucleic acid sequences of interest are defined herein as heteropolymers of nucleic acid molecules. The 
nucleic acid molecules can be double stranded or single stranded and can be a deoxyribonucleotide (DNA) molecule, 
such as cDNA or genomic DNA, or an ribonucleotide (RNA) molecule. As such, the nucleic acid sequence of interest 
20 can, for example, include one or more exons, with or without, as appropriate, introns, as well as one or more of the fol- 
lowing optional sequences, in a functional relationship: regulatory sequences (such as promoter sequences), signal or 
leader sequence, splice donor sites, splice acceptor sites, introns, 5* and 3' untranslated regions, polyadenylation 
sequences, and negative and/or positive selective markers. 

[0028] In one example, the nucleic acid molecule contains a single open reading frame which encodes a protein. The 
25 nucleic acid of interest is operably linked to a suitable promoter. Optionally, the nucleic acid sequence can be operably 
linked to a reporter molecule. 

[0029] The term "operably linked", as used herein, is defined to mean that the nucleotide sequences are linked to a 
regulatory sequence in a manner which allows expression of the nucleic acid sequence. In general, operably linked 
means contiguous. 

30 [0030] Suitable promoters for use in prokaryotic and eukaryotic cells are well known in the art. Exemplary promoters 
include the SV40 and human elongateon factor (EFI). Other suitable promoters are readily available in the art (see, 
e.g.. Ausubel et a/.. Current Protocols in Molecular Biology, John Wiley & Sons. Inc., New York (1998); and Sambrook 
et a/., Molecular Cloning: A Laboratory Manual, 2nd edition. Cold Spring Harbor University Press, New York (1989)). 
[0031 ] Suitable promoters for use in plants are also well known in the art. For example, constitutive promoters for plant 

35 gene expression include the octopine synthase, nopaline synthase, or mannopine synthase promoters from 
Agrobacterium, the cauliflower mosaic virus (35S) promoter, the f igwort mosaic virus (FMV) promoter, and the tobacco 
mosaic virus (TMV) promoter. Specific examples of regulated promoters in plants include the low temperature Kin1 and 
cor6.6 promoters (Wang, et a/., Plant Moi Biol., 23:605 (1995); and Wang, era/., Plant Moi Biol., 28:619-634 (1995)), 
the ABA inducible promoter (Marcotte et al., Plant Cell, 1 .969-976 (1989)), heat shock promoters, and the cold induci- 

40 We promoter from B. napus (White e/ al., Plant Physiol., f 00:917 (1994)). Other suitable promoters are readily availa- 
ble in the art. 

[0032] The term "reporter gene", as used herein, refers to a nucleic acid sequence whose product can be easily 
assayed, for example, colorimetrically as an enzymatic reaction product, such as the lacZ gene which encodes for p- 
galactosidase. The reporter gene can be operably linked to a suitable promoter which is optionally linked to a nucleic 

45 acid sequence of interest so that expression of the reporter gene can be used to assay integration of the transposon 
into the genome of a cell and thereby integration of the nucleic acid sequence of interest into the genome of the cell. 
Examples of widely-used reporter molecules include enzymes such as p-galactosidase, p-glucoronidase, p-glucosi- 
dase; luminescent molecules such as green fiourescent protein and firefly luciferase; and auxotrophic markers such as 
His3p and Ura3p. (See, e.g., Chapter 9 in Ausubel, F.M.. et al.. Current Protocols in Molecular Biology, John Wiley & 

so Sons, Inc., New York (1 998)). 

[0033] The generation of nucleic acid sequences and detection of reporter genes are standard molecular biological 
procedures well known in the art. Alternative combinations or modifications of the elements according to the present 
invention would be apparent to the person of skill in the art. 

[0034] The nucleic acid sequences of interest can be isolated from nature, modified from native sequences or man- 
55 ufactured de novo, as described in, for example. Ausubel et al.. Current Protocols in Molecular Biology, John Wiley & 
Sons. New York (1 998); and Sambrook et a/., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor 
University Press, New York. (1989). The nucleic acids can be isolated and fused together by methods known in the art, 
such as exploiting and manufacturing compatible cloning or restriction sites. 
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[0035] The term Integrated", as used herein, refers to the insertion of a nucleic acid sequence (e.g., a DNA or RNA 
sequence) into the genome of a cell or virus as a region which is covalently linked on either side to the native sequences 
of the cell. 

[0036] As used herein, a cell refers to a eukaryotic or prokaryotic cell. Typically, the eukaryotic cell is of animal or plant 
5 origin and can be a stem cell or somatic cell. The eukaryotic cell can also be a yeast cell, such as, for example, Sac- 
charomyces cerevisiae. Suitable animal cells can be of, for example, invertebrate, mammalian or avian origin. Exam- 
ples of mammalian cells include human (such as HeLa cells), bovine, ovine, porcine, murine (such as embryonic stem 
cells), rabbit, and monkey (such as COS1 cells) cells. The cell can be a fertilized egg cell, an embryonic cell, bone mar- 
row stem cell or other progenitor cell. Where the cell is a somatic cell, the cell can be. for example, an epithelial cell. 
io fibroblast, smooth muscle cell, blood cell (including a hematopoietic cell, red blood cell, T-cell, B-cell, etc.), tumor cell, 
cardiac muscle cell, macrophage, dendritic cell, neuronal cell (e.g., a glial cell or astrocyte), or pathogen-infected cell 
(e.g., those infected by bacteria, viruses, virusoids, parasites, or prions). 

[0037] Typically, cells isolated from a specific tissue (such as epithelium, fibroblast or hematopoietic cells) are cate- 
gorized as a cell-type. The cells can be obtained commercially or from a depository or obtained directly from an animal, 
15 such as by biopsy. Alternatively, the cell need not be isolated at all from the animal where, for example, it is desirable to 
deliver the vector to the animal in gene therapy. 

[0038] The Minos transposable elements can be used to introduce a nucleic acid sequence of interest into the cells 
of invertebrates. For example, the Minos transposable elements can be used to introduce a DNA sequence of interest 
into the cells of arthropods. Arthropods include, for example, crustaceans, arachnids, myriapods and insects. 
20 [0039] The Minos transposable elements can be used to introduce a nucleic acid sequence of interest into either germ 
line or somatic cells. The introduction of nucleic acid into germ line cells has the significant advantage that the nucleic 
acid sequence of interest will be contained in all cells of the mature progeny of the organism and transmitted to its prog- 
eny. 

[0040] The Minos transposable element has been demonstrated to function in a species which is separated from the 
25 Minos source species by an evolutionary distance of 600 million years. The Minos transposable element represents the 
first demonstration of a mobile element which can function autonomously in the germ line of eukaryotes separated by 
an evolutionary distance of over 100 million years and is likely to lead to the development of a long-sought transforma- 
tion system applicable across taxonomic barriers (Loukeris et aL, Science 270:2002-2005 (1995)). 
[0041] However, even within the dipteran class, significant important applications for the Minos element exist. Listed 
30 below are examples of a variety of plant and animal pests, and human disease vectors which fall within the dipteran 
genus. 



AGRICULTURAL PESTS 


COMMON NAME 


Ceratitis capitata 


Medfly 


Anastrepha species 


Carribean fruit fly 


Bactrocera oleae 


Dacus 


Bactrocera species 


Oriental fruit fly 


ANIMAL PESTS 


COMMON NAME 


Cochfiomya hominivorax 


Screw Worm Fly 


Lucilia cuprina 


Sheep blowfly 


Simulium species 


Black fly 


HUMAN DISEASE VECTORS 


COMMON NAME 


Anopheles species 


mosquito 


Aedes species 


mosquito 


Musca domestica 


housefly 



55 [0042] Methods currently employed to control the populations of certain members of the dipteran class include the 
release of sterile males. An exampl of the utility of the germ line transformation methods of this invention includes the 
improvement of the existing release method. Th methods of this invention can be used to improve such methods by 
enabling sexing schemes and for dev loping strains with desired characteristics (e.g., improved viability in th field), 
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conditional lethal genes for improved safety, and visible or molecular genetic markers for monitoring. Genetic sexing, 
i.e. the capability of selectively killing the females (or transforming them into males) in mass-rearing facilities, is recog- 
nized as an important need presently. Rearing and releasing only males has several advantages including lower breed- 
ing cost and the avoidance of population explosions due to inadvertent release of non-sterilized insects. 

5 [0043] For exampl , the Mediterranean fruit fly (Medf ly) Ceratitis (C.) capitata is a major agricultural pest for many 
fruit species that is geographically widespread in tropical and temperate regions. The Medfiy has been introduced rel- 
atively recently into the New World, and appears to be spreading rapidly, threatening fruit producing areas in North 
America (Carey, J.R., Science 253: 1369 (1991)). Since the mid 1970's, the sterile insect technique has been used suc- 
cessfully for Medfiy eradication and control. This method relies on the decrease in or collapse of fly populations follow- 

w ing releases of large numbers of sterile insects over infested areas, and offers an environmentally attractive alternative 
to massive spraying with insecticides (Knipling, E.F., Science 130: 902 (1959)). The germ line transformation methods 
of this invention can be used to improve the sterile insect technique by, for example, enabling sexing schemes. The 
germ line transformation methods of this invention can also be used for developing Medfiy strains with desired visible 
markers that can be used for monitoring effective population control. 

15 [0044] The methods are also useful for insects for which it might be desirable to introduce new traits in the genetic 
pool, rather than controlling the population levels. For example, the presence of several sympatric sub-species of 
Anopheles gambiae, all of which transmit malaria, makes it highly unlikely that population control with biological meth- 
ods such as the sterile insect technique will work. An alternative scheme might involve spreading genes for refractori- 
ness to parasite infection into the existing populations of Anopheles through the use of transposable elements. 

20 Population dynamics simulations indicate that this can be effected by releasing relatively small numbers of individuals 
carrying an autonomously transposing element. 

[0045] Methods for the introduction of the Minos transposon into germ line cells of diptera are analogous to those pre- 
viously used in connection with other transposable elements (see, e.g., Drosophila, A Laboratory Handbook, Ash- 
burner, M., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, (1989)). Briefly, the most common 

25 approach is to employ a carrier/helper transposon system. The carrier transposon is a Minos transposon which has 
been modified by the insertion of a DNA sequence of interest in the region of the transposon flanked by the inverted 
terminal repeats. Typically, sequences relating to the transposase function are deleted in order to accommodate the 
nucleic acid of interest. The helper transposon is a Minos transposable element which encodes an active transposase. 
The transposase catalyzes the transposition of the carrier transposon into the genomic DNA of the germ line of eukary- 

30 otic cells. Typically, the helper and carrier are microinjected into the posterior pole of pre-blastoderm embryos, where 
the precursor cells of the germ line develop. 

[0046] An alternative to the helper/carrier system involves the purification of active transposase (for example, from an 
E. coli culture transformed with a recombinant construct encoding the Minos transposase). The purified transposase 
can be co-introduced into appropriately selected cells along with a carrier transposon to effect integration of the carrier 

35 into the recipient genome. 

[0047] It has now been demonstrated that a nucleic acid sequence of interest can be introduced into a mammalian 
cell using the Minos transposable elements described herein. Thus, the compositions and methods of the present 
invention are also useful for the introduction of a nucleic acid sequence of interest into mammalian cells (e.g., mamma- 
lian somatic cell, mammalian germ line cell (sperm and egg cells)). This can be accomplished by inserting an isolated 

40 transposable element of the type described herein, modified to include the nucleic acid sequence of interest flanked by 
the termini of the isolated transposable element, into a nucleic acid vector, e.g., a DNA vector, such as a plasmid, virus 
or other suitable replicon (e.g., a viral vector), which can be present in a single copy or multiple copies. The vector can 
be introduced into a cell in the presence of a transposase protein or a DNA sequence encoding a transposase protein 
(e.g., helper plasmid) by a method appropriate to the type of cell (e.g., transformation, transfection). The transposase 

45 protein catalyzes the transposition of the modified transposable element containing the nucleic acid of interest into the 
genomic DNA (chromosome) of the cell. The DNA sequence encoding a transposase protein can also be inserted into 
a nucleic acid, virus or other suitable replicon and introduced into a cell as described herein. The modified Minos-trans- 
posable element and DNA encoding the transposase protein can be incorporated into the same or different vectors. 
Examples of suitable methods of transfecting or transforming cells include calcium phosphate precipitation, electropo- 

so ration, microinjection, infection, lipofection and direct uptake. Such methods are described in more detail, for example, 
in Sambrook et ai, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor University Press, 
New York(1989) and Ausubel, era/.. Current Protocols in Molecular Biology , John Wiley &Sons, New York (1998). 
[0048] As a particular example of the above approach to introducing the modified transposable element and/or DNA 
encoding a transposase protein (helper plasmid) into a mammalian cell, the modified transposable element and/or 

55 helper plasmid can be integrated into the genome of a virus that enters the cell. The virus is then introduced into the 
cell in the presence of a transposase protein, or a DNA sequence or virus encoding a transposase protein (helper plas- 
mid). The modified transposabl element and helper plasmid can be incorporated into the same or different viral vec- 
tors. 
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[0049] Viral vectors include retrovirus, adenovirus, parvovirus (e.g., adeno-associated viruses), coronavirus, negative 
strand RNA viruses such as orthomyxovirus ( .g., influenza virus), rhabdovirus (e.g., rabies and vesicular stomatitis 
virus), paramyxovirus (e.g. measles and Sendai), positive strand RNA viruses such as picornavirus and alphavirus, and 
double stranded DNA viruses including adenovirus, herpesvirus (e.g., Herpes Simplex virus types 1 and 2, Epstein- 

5 Barr virus, cytomegalovirus), and poxvirus (e.g., vaccinia, fowlpoxand canarypox). Other viruses includ Norwalk virus, 
togavirus, f lavivirus, reoviruses. papovavirus, hepadnavirus, and hepatitis virus, for example. Examples of retroviruses 
include: avian leukosis-sarcoma, mammalian C-type, B-type viruses, D-type viruses, HTLV-BLV group, lentivirus, spu- 
mavirus (Coffin, J.M., Retroviridae: The viruses and their replication, In Fundamental Virology, Third Edition, B.N. 
Fields, et a/., Eds., Lippincott-Raven Publishers. Philadelphia, 1996). Other examples include murine leukemia viruses, 

jo murine sarcoma viruses, mouse mammary tumor virus, bovine leukemia virus, feline leukemia virus, feline sarcoma 
virus, avian leukemia virus, human T-cell leukemia virus, baboon endogenous virus, Gibbon ape leukemia virus, Mason 
Pfizer monkey virus, simian immunodeficiency virus, simian sarcoma virus, Rous sarcoma virus, Antiviruses and bac- 
uloviruses. 

[0050] A modified transposable element containing the nucleic acid sequence of interest can also be introduced into 
15 a cell by targeting the modified transposable element to cell membrane phospholipids. For example, targeting of a mod- 
ified transposable element of the type described herein can be accomplished by linking the molecule to a VSV-G pro- 
tein, a viral protein with affinity for all cell membrane phospholipids. Such a construct can be produced using methods 
well known to those practiced in the art. 

[0051] A modified transposable element, as described herein, can also be introduced into a cell in a liposome prep- 
20 aration or in another appropriate vehicle. The liposome preparation can be comprised of any liposomes which penetrate 
the cell surface and fuse with the cell membrane, resulting in delivery of the contents of the liposome into the cell. For 
example, liposomes such as those described in Yarosh, U.S. Patent No. 5,077,211; Redziniak ef a/., U.S. Patent No. 
4,621,023; and Redziniak et a/., U.S. Patent No. 4,508,703 can be used. The teachings of these patents are incorpo- 
rated herein by reference. 

25 [0052] In a particular embodiment, the Minos transposon-based method can be used to produce transgenic animals. 
The term "transgenic animal" is a term of art which refers to the introduction of foreign nucleic acid sequences into the 
germline of an animal by, for example, introduction of the additional foreign genetic material to a gamete such as the 
egg. As used herein, the term "foreign nucleic acid sequence" refers to genetic material obtained from a source other 
than the parental germplasm. As used herein, the term "foreign nucleic acid sequence" also includes genetic material 

30 obtained from the parental organism itself. Preferably, the transgenic animals are derived from mammalian embryos. 
The term "mammalian", as defined herein, refers to any vertebrate animal, including monotremes, marsupials and pla- 
cental, that suckle their young and either give birth to living young (eutharian or placental mammals) or are egg-laying 
(metatharian or nonplacerrtal mammals). Examples of mammalian species include primates (e.g., monkeys, chimpan- 
zees), rodents (e.g., rats, mice, guinea pigs) and ruminents (e.g., cows, pigs, horses). 

35 [0053] Methods for acquiring, culturing, maintaining and introducing foreign nucleic acid sequences into recipient 
eggs for transgenic animal production are well known in the art. See, for example, Manipulating the Mouse Embryo: A 
Laboratory Manual, Hogan ef a/., Cold Spring Harbor Laboratory (1986). Preferably, the nucleic acid sequence of inter- 
est (e.g., foreign nucleic acid) will be delivered by the Minos-based transposon system into the embryo at a very early 
stage in development so that only a small frequency of the embryos are mosaic (e.g., an embryo in which integration of 

40 the foreign nucleic acid occurs after the one cell stage of development). 

[0054] A transposon-based method for producing transgenic animals or for stable transfection of cells in vitro has very 
important advantages compared to the methodology presently used. For example, stable integration of nucleic acids 
into the germ line of several mammals is now routinely achieved by micro-injecting linear DNA molecules into the 
nucleus of early embryos. Some of the animals that develop from injected embryos are mosaics for integration events 

45 and in only a fraction of these the germ line is involved. Moreover, most events consist of integration of tandem repeats 
of the injected DNA; single-insertion events do occur at higher frequencies relative to tandem insertions if DNA is 
injected at lower concentrations, but at a considerable cost in time and expense because the overall transformation fre- 
quencies drop. 

[0055] Using a defined transposon-transposase system can overcome some or all of these problems. First, as in Dro- 
so sophila, it may not be necessary to have to inject the DNA into the nucleus. A mixture of transposon plus helper plas- 
mids (or transposon plus purified transposase) that is active when introduced into the cytoplasm would enable 
replacement of costly and time-consuming micro-injection techniques with other methods, such as use of liposomes or 
viruses. Second, by controlling the relative transposon/transposase levels, the overall efficiency can be improved, with 
a parallel increase of the frequency of single-insertion events. 
55 [0056] The compositions and methods of the present invention are also useful for the introduction of a nucleic acid 
sequence of interest into a plant cell to produce transgenic plants. As used herein, the term "transgenic plant" refers to 
the introduction of foreign nucleic acid sequences into th nuclear, mitochondrial or plastid genome of a plant. As used 
herein, the term "plant" is defined as a unicellular or multicellular organism capable of photosynth sis. This includes th 
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prokaryotic and eukaryotic alga (including cyanophyta and blue-green algae), eukaryotic photosynthetic protists, non- 
vascular and vascular multicellular photosynthetic organisms, including angiosperms (monocots and dicots), gymno- 
sperms, spore-bearing and vegetatively-reproducing plants. Also included are unicellular and multicellular fungi. 
[0057] Production of a transgenic plant can be accomplished by modifying an isolated transposable element of the 

5 type described herein to include the nucleic acid sequence of interest flanked by the termini of the isolated transposable 
element. The modified transposable element can be introduced into a plant cell in the presence of a transposase pro- 
tein or a nucleic acid sequence or a virus encoding a transposase protein (e.g., helper plasmid) using techniques well 
known in the art. Exemplary techniques are discussed in detail in Gelvin et al., "Plant Molecular Biology Manual", 2nd 
Ed., Kluwen Academic Publishers, Boston (1995), the teachings of which are incorporated herein by reference. The 

io transposase protein catalyzes the transposition of the modified transposable element containing the nucleic acid 
sequence of interest into the genomic DNA of the plant. 

[0058] For example, for grasses such as maize, the elements of the transposon-based method can be introduced into 
a cell using, for example, microprojectile bombardment (see, e.g., Sanfbrd, J.C., et a/., U.S. Patent No. 5.100,792 
(1992). In this approach, the elements of the transposon-based method are coated onto small particles which are then 

15 introduced into the targeted tissue (cells) via high velocity ballistic penetration. The transformed cells are then cultivated 
under conditions appropriate for the regeneration of plants, resulting in production of transgenic plants. Transgenic 
plants carrying a nucleic acid sequence of interest are examined for the desired phenotype using a variety of methods 
including, but not limited to, an appropriate phenotypic marker, such as antibiotic resistance or herbicide resistance, or 
visual observation of the time of floral induction compared to naturally-occurring plants. 

20 [0059] A modified transposable element, as described herein, can also be introduced into a plant cell by 
Agrobacterium-me6\a\ed transformation (see, e.g., Smith, R.H., era/., U.S. Patent No. 5,164,310 (1992)) orelectropo- 
ration (see, e.g., Calvin, N., U.S. Patent No. 5,098.843 (1992)), or by using laser beams (see, e.g., Kasuya, T. era/., 
U.S. Patent No. 5,013,660 (1991)) or agents such as polyethylene glycol (see, e.g., Golds, T. et al., Biotechnology, 
11:95-97 (1993)), and the like. A modified transposable element, as described herein, can also be inserted into a 

25 nucleic acid vector (e.g. an episomal vector or a Ti plasmid vector), or virus or other suitable replicon (e.g., a viral vec- 
tor), which can be present in a single copy or multiple copies. Viral vectors which can be introduced into plant cells 
include cauliflower mosaic virus, figwort mosaic virus, and tobacco mosaic virus. 

[0060] The vector can be introduced into a plant cell in the presence of a transposase protein, a nucleic acid sequence 
encoding a transposase protein (e.g., helper plasmid) or a virus encoding a nucleic acid sequence encoding a trans- 
30 posase protein using techniques well known in the art. The method of introduction of the elements of the transposon 
based system into the plant cell is not critical to this invention. 

[0061] The present invention also provides vectors containing an isolated Minos transposable element and nucleic 
acid sequence of interest. Suitable vectors for use in eukaryotic and procaryote cells are well known in the art and are, 
generally commercially available, or readily prepared by the skilled artisan. For example, suitable plasmids for use 

35 include pUC1 1 9 and pBlueScript KS. Additional vectors can also be found in, for example, Ausubel et al., Current Pro- 
tocols in Molecular Biology, John Wiley & Sons. New York (1998); Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Ed. (1989); and Gelvin et a/., supra (1995). the teachings of which are incorporated herein by reference. 
[0062] The novel Minos-based stable transfection system of the present invention can be particularly useful in the 
delivery of one or more nucleic acid sequences of interest (e.g., genes) or products thereof to a patient. Generally, the 

40 nucleic acid sequence of interest is present or has been incorporated into the genome of the viral vector. The nucleic 
acid sequence or the product thereof can be a therapeutic agent. An example of a therapeutic nucleic acid sequence 
include RNA (e.g., ribozymes) and antisense DNA that prevents or interferes with the expression of an undesired pro- 
tein in the target cell. The nucleic acid sequence of interest can also encode a heterologous therapeutic protein. A het- 
erologous protein or nucleic acid sequence is one which does not exist in the virus as it is found in nature. Examples of 

45 therapeutic proteins include antigens or immunogens such as a polyvalent vaccine, cytokines, tumor necrosis factor, 
interferons, interleukins, adenosine deaminase, insulin, T-cell receptors, soluble CD4, epidermal growth factor, human 
growth factor, blood factors, such as Factor VIII, Factor IX, cytochrome b, glucocerebrosidase, ApoE, ApoC, ApoAl, the 
LDL receptor, negative selection markers or "suicide proteins", such as thymidine kinase (including the HSV, CMV, VZV 
TK), anti-angiogenic factors, Fc receptors, plasminogen activators, such as t-PA, u-PA and streptokinase, dopamine, 

so MHC, tumor suppressor genes such as p53 and Rb, monoclonal antibodies or antigen binding fragments thereof, drug 
resistance genes, ion channels, such as a calcium channel or a potassium channel, and adrenergic receptors. 
[0063] The invention can be particularly useful for vaccine delivery. In this aspect of the invention, the antigen or 
immunogen can be expressed heterologously (e.g., by recombinant insertion of a nucleic acid sequence which 
encodes the antigen) or immunogen (including antigenic or immunogenic fragments) in a viral vector. Alternatively, the 

55 antigen or immunogen can be expressed in a live attenuated, pseudotyped virus vaccine, for example. Generally, the 
methods can be used to generate humoral and cellular immune responses, .g. via expression of heterologous patho- 
gen-derived proteins or fragments thereof in specific target cells. 

[0064] Generally, viral vectors which contain therapeutic nucleic acid sequences of interest are known in the art. 
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Examples include the vectors described in Anderson, et al. (U. S. Patent No. 5,399,346), Sambrook, et al., supra, 
Ausubel, et ai., supra, and Weiss, et al., RNA Tumor Viruses, Cold Spring Harbor, New York (1985), the contents of 
which are incorporated herein by reference. 

[0065] Also envisioned are the use of these viral vectors comprising a modified Minos transposable element and the 
5 transposase or nucleic acid sequence encoding the transposase protein for in wvo and ex vivo gene th rapy. 

[0066] Where a target cell is contacted in vitro, the target cell incorporating the viral vector comprising a Minos trans- 
posable element modified to include a nucleic acid of interest and the transposase or nucleic acid sequence encoding 
the transposase protein can be implanted into a patient for delivery of the nucleic acid of interest or product thereof. The 
"nucleic acid of interest" is meant to refer to a gene or RNA encoded by a gene for which the patient has an insufficiency 
w or deficiency. The "target cell" as used herein can be migratory, such as a hematopoietic cell, or non-migratory, such as 
a solid tumor cell or fibroblast. Frequently, the target cell is present in a biological sample obtained from the patient (e.g., 
Wood, bone marrow). After treatment (contact with the viral vector comprising the modified Minos transposable element 
and transposase protein or nucleic acid sequence encoding the transposase protein), the sample is returned or read- 
ministered to (reintroduced into) the individual according to methods known to those practiced in the art. Such a treating 
15 procedure is sometimes referred to as ex vivo treatment. Ex vivo gene therapy has been described, for example, in 
Kasid etaL, Proc. Natl. AcadSci. USA, 57:473 (1990); Rosenberg etal., N. Engl. J. Med., 323:570 (1990); Williams et 
al., Nature, 370:476 (1984); Dick et al., Cell, 42:71 (1985); Keller et al., Nature, 3)5:149 (1985); and Anderson, et ai, 
U. S. Patent No. 5.399,346. 

[0067] The modified transposable element and helper plasmid can be incorporated into the same or separate viral 
20 vectors. Where the modified transposable element and helper plasmid are incorporated into separate viral vectors, the 
viral vector comprising the modified transposable element and the viral vector comprising the helper plasmid can be 
simultaneously or sequentially introduced into a cell. The viral vector comprising the modified Minos-transposon can be 
introduced into a cell prior to the viral vector comprising the helper plasmid. Alternatively, the viral vector comprising the 
helper plasmid can be introduced into the cell prior to the viral vector comprising the modified Minos-transposon. 
25 [0068] The mode of administration to a patient is preferably at the location of the target cells. As such, the adminis- 
tration can be nasally (as in administering a vector expressing ADA) orally (as in an inhalant or spray as in administering 
a vector expressing the cystic fibrosis transmembrane conductance regulator (CFTR)) or by injection (as in administer- 
ing a vector expressing a suicide gene to a tumor). Other modes of administration (e.g., parenteral, mucosal, systemic, 
implant or intraperitoneal) are generally known in the art. The agents can, preferably, be administered in a pharmaceu- 
30 tically acceptable carrier, such as saline, sterile water, Ringer's solution, and isotonic sodium chloride solution. 

[0069] Also encompassed by the present invention is the use of the Minos transposable elements and Minos trans- 
posase to induce mutations in a cell and to identify mutations of interest in a cell. Further encompassed by the present 
invention is the use of the Minos transposon elements and Minos transposase to identify genes containing mutations of 
interest in a cell. 

35 [0070] As used herein, the term "mutation" refers to a change or disruption in a gene which leads to a phenotype (e.g., 
physical, biochemical, clinical, molecular, enzymatic, immunological or pharmacological) different from that of the non- 
mutated cell or animal. For example, a mutation in a cell which is normally round in appearance can result in a spindle 
shaped cell. Similarly, a mutation in a cell which normally metabolizes a substrate can lead to an inability to metabolize 
the substrate. A mutation can be silent. As used herein, a silent mutation includes changes which occur in the genetic 

40 material of a cell or animal but is not distinguished from the nonmutant cell or animal on the basis of phenotype. 

[0071] A gene responsible for a mutation of interest can be identified by introducing a Minos transposon. modified to 
include an indicator (e.g., a reporter or selectable marker gene) flanked by the termini of the isolated transposon, into 
a collection of cells in the presence of a transposase protein or a nucleic acid sequence encoding a transposase protein 
under conditions suitable for integration into the genome of a cell using techniques well known in the art. The term "indi- 

45 cator", as used herein, refers to a means to determine whether the transposon has integrated into the genetic material 
(e.g., chromosome) of a cell. For example, an indicator includes a reporter (e.g., lac Z) or selectable marker (e.g., neo- 
mycin) gene product In a particular embodiment, the modified transposon and transposase are transfected into the col- 
lection of cells using viral vector mediated transfection schemes. Following transfection, integration can be detected. 
For example, in a particular embodiment, reporter gene expression can be induced under appropriate conditions and 

so cells which have integrated into their genome the Minos transposon can be identified. Experimental conditions for the 
detection of reporter genes are well known in the art. 

[0072] Integration of the Minos transposon can induce a mutation of interest in the genome of a cell. A cell in which 
the mutation of interest is present is identified by a change in phenotype and clonally propagated using standard tech- 
niques well known in the art. The phenotypic change will depend on the cell type and will be readily apparent to one of 
55 skill in the art. The gene of interest is then identified, cloned and analyzed by the presence of the Minos transposon 
sequenc using standard molecular hybridization techniques, such as in situ hybridization, Southern blotting, and col- 
ony hybridization, employing th sequenc (e.g., the entire sequence or a fragment thereof) of the Minos transposon 
element as a probe using art-recognized methods (see, e.g., Ausubel et al., Current Protocols in Molecular Biology, 
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John Wiley & Sons, New York (1998); and Sambrook et a!., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold 
Spring Harbor University Press, New York (1989)). 

[0073] A uniqu advantage of the transposon system described herein compared to other mutagenesis systems is 
the potential to be able to revers the mutation of interest in a cell by introduction of the transposase protein or a nucleic 
5 acid sequence encoding the transposase protein. Therefore, after cloning of cells with a mutation of interest and cloning 
of the gene responsible for the mutation of interest, some of the clonal cells containing the mutation of interest can be 
subsequently used to confirm that a particular mutant phenotype is the result of integration of the modified Minos trans- 
poson. This is accomplished by the introduction into the cells of a Minos transposase protein, or a nucleic acid 
sequence or virus encoding a Minos transposase protein. The transposase catalyzes a site-specific excision of the 
10 transposon dictated by the inverted terminal repeats at the 5* and 3' ends of the transposon leaving behind a character- 
istic six base pair segment consisting of the four terminal nucleotides of either side of the transposon and the TA target 
site at the location of integration in the genome of the cell (Arc&, B. et a!., Genetics, 745:267-279 (1997)). The trans- 
posase excises the Minos transposon while repairing and rejoining the chromatin thereby reversing the mutation in 
many cases (e.g., mutant phenotype to wildtype phenotype). 
15 [0074] The random integration of the Minos-transposon can be mutagenic in a cell in vitro as discussed above. As a 
result, the Minos-transposon and transposase can be used to induce mutations in a cell or animal. The Minos-transpo- 
son and transposase can also be used, for example, in in vivo or in vitro to induce reversion of a Minos-induced muta- 
tion, as described herein. It is envisioned that the methods and compositions of the present invention can be used to 
induce random excision (removal) events in the genome of somatic cells during development of an animal or differenti- 
ae ation of a cell, thereby permitting the generation of mutations during development of an animal or differentiation of a cell. 
The developmental consequences of the mutations can then be evaluated. Differentiation refers to acquisition or pos- 
session of characteristics or functions differing from that of the original type. For example, a differentiated erythroblast 
cell is an erythrocyte. 

[0075] Using the Minos-transposon and transposase of the invention, a recombination system can be employed to 
25 delete a gene segment with precision in a cell (e.g.. embryonic stem cell). In a particular embodiment, an isolated Minos 
transposable element, modified to include a nucleic acid sequence of interest (e.g., an indicator gene) flanked by the 
inverted terminal repeats of the isolated transposable element is introduced into a cell. The transposon can be intro- 
duced either by the transposase-dependent methodology of the invention or by conventional transgenesis or by other 
means such as another transposable element. The transposon and a transposase protein, or nucleic acid modified 
30 Minos transposable element and the transposase or nucleic acid sequence encoding the transposase protein sequence 
or virus encoding a transposase protein are then introduced into the cell. To facilitate identification and selection of cells 
harboring the integrated transposon and nucleic acid sequence of interest from cells having undergone Minos trans- 
posase-mediated DNA recombination (e.g., excision), tandemly linked bacterial neomycin resistance (neo) and herpes 
simplex virus thymidine kinase (HSV-tk) genes are included in the modified Minos transposon vector. The neo and HSV- 
35 tk sites will serve as positive and negative selection markers, respectively. Other suitable selection markers are known 
in the art and can also be used. Selected cells are donally propagated. Selection and cloning processes are techniques 
well known and readily available in the art. 

[0076] Thus, the Minos-transposon and transposase system can be a valuable tool to mediate recombination and 
reverse and induce mutations in cells and transgenic animals. This method can be efficient for the introduction of dele- 

40 tions of defined regions and lengths in the genome of a cell. 

[0077] To generate a transposon induced mutation in a gene of interest in a cell, an isolated Minos transposon mod- 
ified to include an indicator is integrated into the cloned DNA sequence of the gene of interest in such a way that expres- 
sion of the gene of interest is disrupted. The term "gene of interest" is used to refer to a stretch of DNA in a cell which 
carries the genetic information for a mRN A molecule and corresponding protein. For example, a gene of interest can be 

45 the epidermal growth factor, prolactin, P-selectin or estrogen receptor gene. "Cloned" is a term of art which refers to 
nucleic acid sequences manufactured by molecular biological techniques. 

[0078] The introduction of the transposon into the cloned gene of interest can be achieved by conventional recom- 
binant DNA technologies or by the transposase-induced transposition of the transposon, in vivo or in vitro, into a suit- 
able plasmid containing the gene of interest. The plasmid is amplified and used in standard gene targeting protocols to 

so replace the endogenous gene of interest in a cell by homologous recombination/gene conversion techniques. Endog- 
enous means native to the cell and not derived from the cloned DNA. Homologous recombination occurs between the 
cloned DNA of the gene of interest and the endogenous DNA of the gene of interest thereby targeting the cloned gene 
of interest into the chromosome. The cells containing the targeted mutation are identified and clonally propagated using 
well known, routine methodologies described in detail in several art-recognized protocol texts including, for example, 

55 Ausubel et a/., supra (1998). The cells containing the targeted mutation can be evaluated experimentally. To induce 
reversion of the targeted mutation in a cell a transposase protein or a nucleic acid encoding the transposase protein, is 
introduced into a cell(s) resulting in excision of the transposon and in many instances reversion of the mutation. Muta- 
tions that can be reversed are those in which the six-bas pair footprint remaining after Minos transposon excision does 
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not disrupt expression of the gene. 

[0079] In a preferred embodiment the cells are embryonic stem cells. The embryonic stem cells are used for gene 
targeting and the resulting mutant cells ar used to create transgenic animals and animals carrying null or "knock-out" 
mutations. A "knock-out" mutation refers to the disruption of a gene of interest with a complete loss of function. The 

5 mbryonic stem cells which contain the gene of interest integrated into their genome by the Minos-transposon system 
can be transmitted to the germline of an animal, such as a mouse, by injection into an early cleavage stage embryo 
(e.g., blastocyst) or by aggregation with two morulae to produce a chimera. "Chimera" is a term of art intended to mean 
an embryo containing cells or tissues with two or more genotypes. Chimeras carrying the mutated or foreign nucleic 
acid sequence in their germ cells are then bred to produce transgenic offspring that are entirely derived from the embry- 

10 onic stem cells which carry the mutation. Genetic markers such as coat color in mice can be used to distinguished chi- 
meras and animals derived entirely from embryonic stem cells. Experimental techniques for obtaining, propagating, 
cloning and injecting embryonic stem cells are well known in the art. See, for example, Evans et ai, Nature, 292:154- 
156 (1981); Rossant eta!., Experimental Approaches to Embryonic Mammalian Development, Cambridge University 
Press (1986); Sedivy et ai., Gene Targeting, W.H. Freeman and Co.. New York (1992); Ausubel etai, supra (1998). 

is [0080] The Minos-transposon approach has several advantages over the recombination techniques currently in use 
such as the Cre/ LoxP system. For example, the introduction of nucleic acids sequences of interest is performed directly 
by the Minos transposon. No additional components, such as target sites, are required. In addition, using the present 
method, a single copy of a nucleic acid sequence of interest can be integrated and precisely excised from the genetic 
material of a cell in each integration step. 

20 [0081] The invention also relates to the use of the Minos transposable element to identify gene enhancer elements. 
The term "enhancer", as used herein, refers to any cis-acting nucleic acid sequence that increases or augments the uti- 
lization of a gene promoter and can function either upstream or downstream from the promoter. An enhancer element 
can be close to or distant from the promoter. In this aspect of the invention, an isolated Minos transposable element is 
modified to include an indicator gene (e.g., nucleic acid sequence encoding a suitable reporter molecule such as p- 

25 galactosidase or a selectable marker (e.g., neomycin resistance) flanked by the inverted repeats of the isolated trans- 
posable element, which is optionally linked to minimal promoter (e.g., a TATA box sequence). As used herein, the term 
"minimal promoter" includes nucleotide sequences upstream from the Minos transposon that can weakly initiate tran- 
scription. 

[0082] For enhancer detection methods, the nucleic acid sequence comprising a minimal promoter and Minos trans- 

30 posable element, modified to include an indicator gene (e.g., a reporter gene such as lacZ) or selectable marker such 
as neomycin gene), is incorporated into a suitable vector, as described herein, and introduced into a population (or 
sample) of cells under conditions appropriate for integration into the genome of a cell in the presence of a transposase 
protein or a nucleic acid sequence or virus encoding a transposase protein. Integration into the genome of a cell at or 
near an enhancer site can be detected by the indicator. For example, selection of cells or detection of the reporter gene 

35 product. It is expected that varying ranges of signal, in the case of a reporter gene, and selection, in the case of a 
selectable marker, can occur depending upon the strength of the enhancer. Once an enhancer region has been identi- 
fied by, for example, expression of a reporter gene, the enhancer site can be located within the genome by standard 
hybridization protocols (e.g., in situ hybridization and Southern blotting with Minos transposon specific probes) and the 
resulting sites readily cloned and analyzed. Experimental conditions for the detection of reporter and selectable marker 

40 genes as well as hybridization techniques and genomic sequencing are well known in the art. 

[0083] The methods and compositions of the present invention can also be used to detect and trap an exon of a gene 
in a cell. The term "exon", as used herein, is any segment or region of a gene which is represented in the mature mRNA 
transcription product. Most eukaryotic genes and some prokaryotic genes include additional nucleic sequences 
referred to as introns that are within the coding region of a gene but do not appear in the mature mRNA. Introns are 

45 dispersed among the exons in the genetic material of cells. To identify an exon of interest in a gene, an isolated Minos 
transposable element is modified to include an indicator gene (e.g., reporter or selectable marker gene) lacking a trans- 
lation initiation codon but linked to a splice acceptor sequence and flanked by the inverted terminal repeats of the iso- 
lated transposable element. The modified transposable element is incorporated into an appropriate vector and 
introduced into a population of cells in the presence of a transposase protein or a nucleic acid sequence encoding a 

so transposase protein. In a particular embodiment, the modified transposable element and/or the transposase is incorpo- 
rated into a viral vector, which is introduced into a population of cells. Random integration of the transposon into an 
intron of a gene in the correct orientation can result in transcription of hybrid mRNA encoding, for example, an indicator 
gene, such as a reporter or marker gene. mRNA transcribed from a gene disrupted by integration of the modified trans- 
poson in an intron results in a change in mRNA splicing patterns compared to the gene lacking the integrated transpo- 

55 son in such a way that a hybrid mRNA is produced carrying, for example, the reporter or marker gene as an exon. This 
change in splicing pattern signifies the presence of an exon. Genes targeted in this way can be isolated by virtue of their 
b ing linked to the Minos transposon. Methods for transfection, report r gene expression, selection conditions, mRNA 
isolation, reverse transcription protocols, nucleic acid sequencing and hybridization techniques are ail well known art- 
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recognized technologies. Exemplary discussions and detailed protocols can be found in Ausubel etai, supra and Sam- 
brook era/., supra. 

[0084] The gene- and enhancer-trapping strategies described above can provide for a novel and relatively simple 
method of identifying developmerrtally regulated genes. For example, reporter genes lacking promoters can be ran- 

5 domly integrated into the genome of a cell by using the Minos-based transposon system described herein. For example, 
a modified transposon element containing a "promoter-less" reporter gene can be introduced into mouse embryonic 
stem cells, which are then introduced into embryos. Screening for expression of the reporter gene permits the identifi- 
cation of endogenous genes which become transcriptionally active in the developing embryo or in the embryonic stem 
cells in vitro. The molecular tag provided by the Minos transposon enables developmentally expressed and regulated 

10 genes to be readily identified, cloned and analyzed. 

METHODS FOR ISOLATING ADDITIONAL Tc-1 FAMILY MEMBERS 

[0085] DNA sequence analysis of the members of the Minos family disclosed herein, and comparison of this 
is sequence information to the sequences of Tc-1 family members from evolutionary distant organisms (e.g., nematode), 

reveal short stretches of conserved amino acid sequence within the transposase coding region. This high degree of 

conservation suggests a method for isolating Tc-1 family members from diverse eukaryotic species. 

[0086] This method involves the amplification of DNA by polymerase chain reaction from a eukaryote of interest using 

primers which are complementary to a sequence of at least about 12 consecutive nucleotides which encode amino 
20 acids which are highly conserved in aligned sequences of nematode Tc-1 family members and dipteran Minos family 

members. Such amino acid sequences include, for example, MVWGC (SEQ ID NO:10). WPSQSPDL (SEQ ID NO:1 1) 

and WPSNSPDL (SEQ ID NO:12). 

[0087] The present invention will now be illustrated by the following examples, which are not intended to be limiting in 
any way. 

25 

EXAMPLES 
EXAMPLE 1 
30 MATERIALS AND METHODS 
[0088] 

1. FLY STRAINS 

35 Standard procedures were used for culturing of Drosophila hydei. All strains used in this study have been used 

previously for rDNA work and are named for the X and Y chromosomes. Strain bb 1 (bb 1 /bb 1 x bb 1 /Y) carries a 
bobbed X chromosome; strain X 7 (X 7 /X 7 x X 7 /Y) is a subline of the Dusseldorf wild-type strain; strain X A X/Y(X A X/Y 
x X/Y) females carry a compound X chromosome which has no rDNA. Strain wm1/Y (wm1/Y x X-3/Y) females have 
a compound X chromosome (wm1); males carry a X-autosome 3 translocation which has no rDNA. 

40 

2. DNA MANIPULATIONS AND SEQUENCING 

All basic procedures were carried out essentially as described (Maniatis er a/., 1 982). DNA from adult females 
of strain bb 1 was partially digested with EcoRI and cloned into phage vector Xgt7. To recover new Minos elements, 
the library was screened by hybridization with a 1.7 kb Hhal fragment which contains most of the Minos-1 
45 sequence. For sequencing, the appropriate restriction fragments from positive clones were subcloned into plasmid 
vectors pUC8 and pUC9 and nested deletions were generated by digestion with exonuclease Bal31 followed by 
subcloning. Sequencing was performed by conventional methods. Both strands were sequenced, with a minimum 
of two independent sequences for each base pair. 

so 3. SEQUENCE ANALYSIS 

Database searches and sequence analysis and manipulations were performed using programs FASTA (Pear- 
son and Lipman. Proc. Natl. Acad. Sci. USA 55:2444-2448 (1988)). BLAST (Altschul eta!., J. Mol. Biol. 275:403- 
410 (1990)) and the computer package GCG (Devereux et ah, Nuc Acids Res. 72:387-395 (1984)). The program 
CLUSTAL (Higgins and Sharp, 1988) was used for protein sequence alignments. 
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RESULTS 



1. THE SEQUENCE OF MINOS 



10 



15 



20 



25 



30 



35 



[0089] Three new representatives of the Minos family of transposabte elements have been cloned and sequenced; 
they have been named Minos-2, Minos-3 and Minos-4, Minos-1 being the element reported previously. Minos-2 and 
Minos-3 are complete elements distinct from Minos-1 , as judged from the restriction maps of the flanking DNA and the 
flanking sequences. The sequences of the elements, summarized in Figure 2A-2C, show very little variation, differing 
in only two positions. At position 900 of the sequence, Minos-2 and Minos-3 have a G instead of the A found in Minos- 
1 . This transition changes a TAG stop codon to TGG and restores a 603 bp ORF beginning with ATG at position 878. 
The second difference is at nucleotide 1161, which is a C in Minos-1 and Minos-3 and a T in Minos-2. This causes a 
ser leu substitution in ORF2 of Minos-2, relative to Minos-1 and Minos-3. Minos-2 and Minos-3, therefore, have two 
complete ORFs beginning with an ATG; ORF1, which can encode a 133 amino-acid peptide, and ORF2, which can 
encode a 201 amino-acid peptide. 

[0090] The Minos-4 clone does not contain a complete element. The sequence of the cloned DNA fragment begins 
at the EcoRI site found at position 1 1 72 of the other members and is identical to the Minos-1 sequence to base 1 779. 
Apparently Minos-4 represents a partial isolate rather than a defective member of the family, since the library from 
which it was isolated was from DNA cut with EcoRI. 

[0091] The DNA sequence flanking the cloned elements are different from each other; this indicates that these ele- 
ments are inserted at different sites of the D. hydei genome, and are, therefore, distinct. These sequences are mainly 
characterized by a high NT content, and do not show any other obvious similarity. In all cases, the inverted repeats end 
with the dinucleotide TA, which is at the same time a direct and an inverted repeat. Because of this, there is some ambi- 
guity in defining the ends of the element precisely. Shown below are the sequences of the Minos 1-4 insertions sites. 
The rDNA sequences flanking the Minos elements are shown in lower case and Minos sequences are shown in upper 
case The rDNA sequence identical to the flanking DNA of Minos-1 has been aligned with the Minos-1 insertion 
sequence. It is noted that since gapped sequences are treated as separate sequences for purposes of the Rules of 
Practice in Patent Cases (37 CFR 1.822(o)), and since each of the separate sequences contain less than 10 nucle- 
otides, the sequences shown below have not been listed in the Sequence Listing. 

[0092] In the case of Minos-1 , which is inserted into a region which has been previously sequenced, the external tran- 
scribed spacer of the rDNA repeat, there are two possibilities. As shown below, deleting the sequence which begins 
with ACGA and end with TCGT would restore the rDNA sequence; the element, with an A and a T at the two ends may 
have inserted between a T and an A. In this possibility, the element would be 1 779 bp long with 255 bp inverted repeats. 
Alternatively, the element may begin and end with CGA..TCG and produce a target site duplication, as happens with 
many other mobile elements. In this possibility the target site duplication would involve the dinucleotide TA, and the size 
of the element would be 1777 bp. For numbering, the A of the TA repeat has been designated nucleotide number 1 of 
the Minos-1 -3 sequences. 



40 



45 



rDNA 

Minos-1 

Minos-2 

Minos-3 

Minos-4 



ataat- 

ataatACGA 

aatatACGA 

gctttACGA 

tttctACGA 
I 

1775 



-attaa 

-TCGTattaa 
-TCGTataat 
-TCGTagaag 

I 

1 



50 

2. MOBILITY AND HOMOGENEITY OF MINOS ELEMENTS 

[0093] The striking degree of sequence conservation among the cloned Minos elements suggests that, as in the case 
of Tc1 , ail Minos elements may be highly homogeneous. To test this the single Hhal site within each of the terminal 
55 repeats of Minos was exploited. The 1 .68 kB Hhal fragment of Minos-1 was used as probe in a Southern blot of genomic 
DNA from the same strains, digested with Cfol, an isoschisomer of Hhal. A single, strong band of approximately 1 .7 kb 
was detectable in all lanes, indicating that no major deletions or rearrangements are present in the Minos lements 
pres nt in these strains. 
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3. COMPARISON OF THE PROTEINS ENCODED BYTd AND MINOS 



[0094] The deduced 201 amino acid sequence of the ORF2 in Minos-2 and Minos-3 shows significant sequence sim- 
ilarity with the 201 carboxy terminal residues of TcA, the putative transposase of Tc1 ; alignment of the sequences gives 

5 63 identities (31%) and 91 conservative substitutions (45%) with only two single-residue insertion-deletions. The two 
sequences, however, differ in size; TcA has 72 additional amino acids at the amino end. The 50 amino-terminal resi- 
dues of TcA show weak but significant sequence similarity with the carboxy terminus of Minos ORF2; introduction of a 
60-bp deletion in the Minos DNA sequence creates a long open reading frame which contains most of ORF1 (codons 
1 to 138) and the entire ORF2 extended by 22 codons upstream of the ATG. Interestingly, this 60-bp sequence, from 

10 base 752 to base 81 1 of the Minos sequence, exhibits features of an intron. More specifically, the 5' and 3' ends conform 
to the consensus splice donor and acceptor sites and a version of the internal splice signal consensus is found 30 
nucleotides upstream from the 3' end. 

4. DIVERGENCE OF THE TcA- RELATED SEQUENCES 

75 

[0095] Although Minos inhabits a Drosophila species, it is not more related to the other Tc1-like elements from 
Drosophila species, HB1 and Uhu. These elements, or at least the members which have been sequenced, do not con- 
tain open reading frames comparable in length to that of Tc1 . However, if small numbers of deletions and insertions are 
introduced in their DNA sequences, open reading frames can be generated which show significantly similarity with the 

20 TcA sequence. Most of these insertion-deletion changes involve one nucleotide, presumably representing mutations 
which have accumulated in these inactive elements. Table 1 shows a similarity matrix between the three Drosophila and 
the two nematode elements, in the regions corresponding to the hypothetical Minos exon 2. In Table 1 , percent identities 
are shown above the diagonal; identical/total positions are shown below the diagonal. Minos shows approximately the 
same degree of similarity (between 28 and 36 percent identity) with ail the other elements; HB1 and Uhu show compa- 

25 rable similarities. In a multiple sequence alignment of the same regions, 21 of the resulting 225 positions (9%) are invar- 
iant and 49 positions (22%) are occupied by related amino acids. It should also be noted that the similarity between HB1 
and Uhu with Tc1 and Minos extends another 1 8 codons upstream from the position corresponding to the first codon of 
the hypothetical exon 2 of Minos. No other significant similarities can be detected between Tc1, Uhu, HB1 and Minos 
in the sequences between the terminal repeats. 

30 



TABLE 1 



35 



40 





Td 


TCb1 


Minos 


Uhu 


HB1 


Td ; 
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31 


44 


33 


TCb1 


160/223 




34 


41 


35 


Minos 


70/221 


75/222 




36 


28 


Uhu 


96/217 


89/217 


78/218 




31 


HB1 


73/223 


79/223 


62/222 


68/219 





5. THE ORF1 SEQUENCE IS RELATED TO THE PAIRED BOX SEQUENCE 

45 [0096] Searches of the nucleic acid and protein sequence data libraries with the ORF1 sequence using the FASTA 
and WORDSEARCH algorithms gave no significant matches. However, the Basic Local Alignment Search Tool program 
revealed a similarity with the paired box sequence, a peptide sequence found in the Drosophila paired gene product, 
and conserved in other Drosophila and mammalian genes. This similarity extends approximately between residues 1 
to 96 of the Minos sequence, and residues 35 to 1 31 of the Drosophila paired protein. Alignment of the Minos sequence 

so with the Drosophila and human paired box sequences for maximum similarity shows 1 6 invariant positions in this region 
(17%) and 49 positions occupied by related amino acids (51%). The corresponding values for the human and 
Drosophila paired sequences are 72% identities and 23% conserved positions. 

[0097] Although the Minos-paired similarity is weak compared to that between the Drosophila and human paired 
sequences, it is statistically significant. The similarity scores between the Minos sequence (amino acids 1 to 118 of 
55 ORF1) to the corresponding human paired sequence (amino acids 17 to 135 of the published sequence) is approxi- 
mately 1 0 standard deviations higher than the average of the scores obtained from 50 comparisons made between the 
Minos sequence and 50 randomly shuffled human paired sequences. 
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6. TRANSPOSITION IN D. melanogaster 

[0098] A D. melanogaster "helper" strain which can overproduce the Minos transposase upon exposure to heat shock 
was constructed. The strain was constructed by introducing a modified Minos element into the germ line by conven- 

5 tional P element transformation (see, e.g., Drosophila, "A Laboratory Handbook", Ashburner, M., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York, (1989)). To place the Minos transposase under heat shock control, 
the left-hand terminal repeat of Minos-2 was replaced by the D. melanogaster hsp 70 promoter. This modified element 
was inserted into the P element transformation vector pDM30, which contains a wild-type copy of the Drosophila rosy 
(ry) gene as a dominant visible marker. The plasmid (pPhsM2) was injected into pre-blastoderm embryos of a ry strain, 

w injected GO adults were mated to ry flies and ry* G1 progeny were bred further. Three independent transformants were 
recovered, two on the third chromosome (named M46 and M67) and one on the X (M84). Southern blots using ry and 
Minos probes indicated that each of the three transformants contains a single insertion of the complete sequence 
between the P element ends. Northern blots of total RNA from adult transformed flies subjected to a heat shock showed 
abundant transcripts hybridizing to Minos probes. No Minos-related transcripts have been detected by the same probes 

15 in RNA from non-heat shocked flies. The structure of the RNA transcripts was investigated in another series of experi- 
ments discussed below. 

[0099] Breeding of these transformants showed that they are all homozygous lethal. This observation was unex- 
pected; the recovery of recessive lethal mutations due to insert onal inactivation of essential genes is a rather uncom- 
mon event in P transformation experiments. Moreover, the insertion into the X clearly has not caused a "knock-out" 

20 mutation since hemizygous males are viable and fertile; only homozygous females are inviable. This behavior sug- 
gested that the lethality may be dosage- or pairing-dependent; the latter being more likely because double heterozy- 
gotes of the two insertions in the 3rd chromosome are viable. The observed lethality is a useful feature which enables 
one to follow the segregation of the "helper" chromosomes by keeping them over genetically marked balancers. 
[0100] Strong evidence for Minos transposition in the germ line was obtained by first introducing the M67 chromo- 

25 some into a white background (y,w; TM3/M67). Pre-blastoderm embryos were injected with a plasmid (pM2w) contain- 
ing a complete Minos-2 element with a wild-type copy of the white (w) gene inserted into its unique EcoRI restriction 
site within ORF2. The inserted w sequences provide a dominant selectable marker; in addition they interrupt ORF2, 
making the production of active transposase from this construct highly improbable. Three separate experiments were 
conducted: In experiment A injected embryos and the developing larvae and adults were kept at 18 degrees C, in exper- 

30 iment B they were kept at 25 degrees C throughout development, and in experiment C the embryos were subjected to 
a 1-hour 37°C heat shock three hours after injection. All emerging GO flies (63, 38 and 61 , from experiments A, B and 
C, respectively) were mated to y.w; TM3/Dgl3 flies and the progeny were scored for the appearance of the w* pheno- 
type. To date, at least four independent germ line transformation events have been detected in experiments A and B. 
Two of these events come from a single GO male from experiment A and at least two have been recovered from two 

35 different GO flies from experiment B. The results are shown in Table 2 below: 
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75 
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B33 


116 


B33.1-18 





[01 01 ] Evidence that the Minos-w + transposon can be mobilized in the soma of flies which produce the transposase 
55 has been obtained. Larvae of the constitution y,w; TM3/[M2w]M67 (progeny of the A10.2 fly), which contain both trans- 
poson and helper sequences, were subjected to heat shock and adult flies were examined for the appearance of eye 
color mosaicism. More than 50% of the flies showed mosaicism of different degrees. Patches of ommatidia with either 
reduced or increased pigmentation were observed which is consistent with the expected result of a somatic deletion or 
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transposition event. No mosaicism has been detected in flies not subjected to a heat shock at the larval stage. The 
somatic instability results clearly indicate that the w + insertions are minos-mediated. 

7. ANALYSIS OF MINOS mRNA TRANSCRIPTS 

5 

[0102] Total RNA was isolated from the M67 strain, the construction of which is described above. The structure of 
mRNA transcripts was investigated by the polymerase chain reaction (PCR) method of DNA amplification. A particularly 
important aspect of this investigation was to determine the status of the 60 base pair putative intron region (discussed 
above) in the mRNA transcripts. As was mentioned previously, this sequence is characterized by 5* and 3' ends which 
10 conform to the consensus splice donor and acceptor sites, and has a version of the internal splice signal consensus 
sequence 30 nucleotides upstream from the 3' end. 

[0103] To determine the status of this putative intron, PCR priming sites were selected from exon sequences (ORF1 
and ORF2) flanking the putative intron. The PCR product synthesized in this reaction was cloned and sequenced by 
conventional methods. The sequencing experiments revealed unambiguously that the 60 base pair intron sequence 

15 was, in fact, absent in the amplified DNA. 

[01 04] The removal of the 60-bp sequence in the correctly spliced primary transcript initiating upstream from ORF1 , 
results in the generation of a 1023-bp open reading frame which encodes a peptide of 341 amino acids. An alignment 
of the 273 carboxy-terminal amino acids of this peptide with the sequences of TcA and the 273-residue hypothetical 
peptide of TCb1 was generated by the multiple alignment program CLUSTAL, which introduces gaps in the sequences 

20 to achieve maximum sequence similarity. The three sequences were aligned without the need of any insertions-dele- 
tions (with the exception of the two one-residue gaps required for optimal alignment in the ORF2 region) and show an 
overall 28% identity, i.e. 76 of the 273 positions are invariant. In the region upstream from the first methionine of ORF2, 
twelve out of seventy two positions (16%) are invariant; 29 positions (40%) are occupied by structurally related amino 
acid residues. Although this degree of similarity is lower than that in the ORF2 region, it is statistically significant. 

25 [0105] The sequence similarity between TcA and the carboxy end of the Minos hypothetical protein is also reflected 
in their secondary structures. Comparisons of a-helix and p-sheet predictions and hydrophobicity profiles between the 
Tc1 and Minos sequence show similarities in several regions. Another feature of the sequences is their high content, 
approximately 20%, in basic amino acids. TcA has 29 arginines, 16 lysines and 11 histidines, and the TcA-related 
Minos sequence has 20 arginines, 32 lysines and 4 histidines. These are more abundant at the amino-terminal half of 

30 both sequences, although the position of most is not strictly conserved. The proteins are fairly basic, with computed iso- 
electric points of 1 1 .27 for TcA and 1 0.73 for the related Minos peptide. The computed pi of the complete hypothetical 
361 amino acid Minos protein is 10.97. 

8. GENE TRANSFER INTO C. capitata USING MINOS TRANSPOSABLE ELEMENTS 

35 

[0106] Single copies of exogenous DNA can be introduced into the genome of C. capitata by using a germ line trans- 
formation system which utilizes the transposable element Minos to mediate precise integration of DNA at acceptable 
frequencies. 

[01 07] To provide an effective dominant selectable marker for detection of transformants, an approximately 3.7 kb Not\ 
40 fragment containing the wild-type white eDNA of C. capitata. flanked by the D. melanogaster hsp 70 promoter and ter- 
minator sequences, was inserted into the A/of I site of the Minos vector pMiA/of which was constructed by replacing a 
644 bp Msc\ fragment of the Minos transposase gene (nucleotides 618 to 1264 of Figure 2A-2C) with a Not\ linker. This 
modified element (shown in Figure 3A) was inserted into the E. coli vector pTZ18R (Pharmacia), creating a plasmid 
(pMihsCcw) having a wild-type copy of the C. capitata white (w) gene as a dominant visible marker. 
45 [0108] To place the Minos transposase under heat shock control, the left-hand terminal repeat of Minos-2 was 
replaced by a 456 bp fragment containing the D. melanogaster hsp 70 promoter. This modified element (shown in Fig- 
ure 3B) was inserted into the £ coli vector pTZ18R (Pharmacia), creating the transposase-producing plasmid 
pHSS6hsMi. 

[0109] The plasmids pMihsCcw and pHSS6hsMi were introduced into pre-blastoderm Medfly w/w embryos by a 
so microinjection procedure similar to that used for Drosophila. For egg collecting, flies were mass-reared in population 
cages at 24°C. Eggs were collected at 24°C for 60 minutes, and then were dechorionated, desiccated and microinjected 
at 18°C with a mixture of 100 mg/ml helper and 400 mg/ml transposon plasmid DNA as described for Drosophila 
embryos (Rubin, G.M. and Spradling, A.C., Science 218: 348 (1982)). Modifications of the procedure were not neces- 
sary, because the eggs of the two species are similar in morphology and in resistance to desiccation. 
55 [01 1 0] A total of 3,998 embryos were injected. After injection, they were left to hatch under halocarbon oil, and first 
instar larvae were transferred to Petri dishes containing standard larval food (Mintzas, A.C. et a/., Dev. Biol 95: 492 
(1983)). The 390 adults (GO generation) resulting from injected embryos wer collected within 12 hours after eclosion 
and back-crossed to w flies in small groups consisting of either 5 GO males and 10 virgin w females, or 10 GO females 
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and 5 w males. Fifty-nine such GO groups were reared in small plastic cages and the G1 progeny were collected and 
handled separately for each group. To induce expression of the w mini-gene from the Hsp70 promoter, G1 pupae were 
exposed daily to a 39°C heat shock for one hour. The 62,510 G1 flies that were produced were screened for the pres- 
ence of non-white eye phenotypes. As shown in Figure 4, a total of 72 flies with colored eyes were recovered from four 
5 different cages. 

[011 1] The w mini-gene gives partial reversion of the phenotype. Eye color varies in strength among different trans- 
formants. The phenotype is dosage-dependent with homozygotes having stronger colors than heterozygotes. These 
characteristics of w markers are useful in sorting multiple insertions and in distinguishing homozygous from hetero- 
zygous transformants. The characteristics are due to low levels of expression combined with chromosomal position 

w effects and have been observed previously in Drosophila. 

[01 1 2] To establish transformed lines, individual G1 's were initially back-crossed to w flies. Single pairs of transformed 
G2 progeny were then mated, and their homozygous G3 progeny, recognized by their stronger phenotypes, were 
used to construct homozygous lines. Table 3 shows the results from the G1 back-crosses. In these crosses, the non- 
white eye ( w*) phenotype was inherited as a single, dominant trait. 

15 [01 1 3] To determine the effect of temperature on the expression of the w mini-gene, a number of G2 pupae were not 
subjected to the heat shock treatment. When compared to the heat-shocked cohort, G2 flies which had not been heat 
shocked as pupae showed either paler eye color or no eye color at all; the only exception was lines 3.1 and 3.3, which 
exhibited an invariant strong yellow eye phenotype. The heat shock dependence clearly showed that the flies (perhaps 
with the exception of 3.1 and 3.3) were true transformants, rather than revertants of the w mutation. In cages 3 and 25, 

20 differences in the eye color phenotypes of individual G1 's from the same cage were detected and bred true, suggesting 
that independent transformation events had occurred in the same cage. 



TABLE 3 







With heat shock 


Without heat shock 




G1 


Eye color of heter- 
ozygote s 


non-white eyes 


white eyes 


non-white eyes 


white eyes 


Eye color of 
homozygotes 


1.1 


pale yellow 


46 


53 


0 


59 


apricot 


1.8 


pale yellow 


220 


274 


0 


77 


apricot 


1.12 


pale yellow 


94 


69 


0 


8 


apricot 


3.1 


yellow 


267 


237 


110 


97 


yellow 


3.3 


yellow 


225 


214 


53 


49 


yellow 


3.2 


pale yellow 


132 


118 


0 


76 


apricot 


3.6 


pale yellow 


70 


81 


0 


81 


apricot 


25.7 


pale apricot 


119 


156 


116* 


91 


apricot 


25.8 


pink 


24 


18 


0 


27 


peach 


25.9 


pink 


30 


34 


0 


9 


peach 


33.2 


pale orange 


42 


50 


ND 


ND 


orange 


33.3 


pale orange 


29 


31 


ND 


ND 


orange 


33.4 


pale orange 


16 


15 


ND 


ND 


orange 



* Eye color much weaker than with heat shock. 



[0114] To determine the nature of the integration events, DNA from transformants was analyzed by Southern blot 
hybridizations using several restriction enzymes and two probes (see Figure 3A), one (M) containing the Minos 
sequences at the ends of the transposon (which are not present in non-transformed Medf ly), and another (W) contain- 
ing an internal fragment of the w cDNA sequences (which is present in the endogenous w gene). 
55 [0115] Adult genomic DNA (approximately 10 \lq per lane) was digested with a restriction endonuclease, subjected 
to agarose gel electrophoresis, blotted onto nitrocellulose membrane filters and hybridized with 32 P-labeled probes. 
Membranes were pre-hybridized for 6 hours at 65°C in 7% SDS, 0.5 M phosphate buffer pH 7.4, 1 mM EDTA. Hybridi- 
zation was for 12- 14 hours at 65°C in 7% SDS, 0.5 M phosphate buffer pH 7.4, 1 mM EDTA. Excessprob wasr moved 
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by two 10-minute washes with 5% SDS, 40 mM phosphat buffer pH 7.4, 1 mM EDTA at 65°C followed by a 20-minut 
wash at room temperature with the same buffer pre-warmed at 65°C. 

[0116] DNA from lines 3.1, 3.2, 3.3 and 3.6 was cut with Sal\ and hybridized with a 1 kb Hha\ fragment containing 
Minos sequences present in pMiMtf (M probe of Figure 3A). 

5 [0117] DNA from the recipient w strain and from lines 3.1 , 3.2, 3.3 and 3.6 was cut with HincW. and probed with a 
SalUXhol fragment containing 1.5 kb of Medfly w cDNA sequences (W probe of Figure 3A) and with the M probe. 
Between the two hybridizations the filter was dehybridized by washing with boiling 0.5% SDS solution for 2 minutes. 
[01 18] In Drosophila, insertions of elements like Minos can occur at many different chromosomal sites, and are char- 
acterized by precise integration extending through the terminal inverted repeats of the element without transposition of 

w any flanking plasmid DNA. The results of M-hybridized Sal\ digests document that the events in the Medfly are of the 
same nature. The transposon has inserted variable host DNA sites, and no significant (> 0.2 kb) flanking plasmid DNA 
to the right of the transposon can be present, because this would have been signaled by the presence of a 2.9 kb band. 
The results also confirm that two independent events have occurred in cage 3, one represented by lines 3.1 and 3.3 
and the other by lines 3.2 and 3.6 (cf. Table 3). These conclusions were also confirmed with HincW digests. Similarly, 

is blots of HincW digests hybridized with the W probe showed the two endogenous w gene bands, plus a third novel band 
that is characteristic of the insertion event (3.1/3.3 or 3.2/3.6). The shortest band is longer than the 1.9 kb band that 
would have been expected if the HincW site. 0.2 kb to the right of the Minos end (see Figure 3A) had been present. The 
same HincW blot hybridized with the M probe showed that the shortest band is longer than the 1 .1 kb band that would 
have been expected if plasmid sequences to the left of the transposon were present. These results were confirmed with 

20 W-hybridized Sail digests. 

[01 1 9] To assess the integrity of the internal part of the transposon, restriction analysis using EcoRI was performed 
in three lines derived from cage 25. DNA from strains 25.7, 25.8 and 25.9 was cut with EcoRI and hybridized with the 
W and M probe sequentially. In addition to the transformants showing non-white eye phenotypes white-eyed siblings 
(25.9-w, 25.8-w, 25.7-w) were included in this analysis. The results of the hybridization with the W probe indicate that 

25 the entire 3.7 kb fragment containing the HsplQIw marker fusion is present in the w* transformants. Hybridization of 
the same filter with the M probe, which detects "chimeric" end fragments, showed that lines 25.8 and 25.9 contain the 
same, single insertion of the transposon. The pattern in 25.7 is consistent with the presence of two insertions, neither 
identical to the 25.8/25.9 event. One of these insertions, defined by the -3 kb and -5.5 kb bands, is also present in the 
white-eyed siblings of the 25.7 flies. This, presumably, represents a "silent" insertion that does not express the pheno- 

30 type either due to an undetected lesion in the transposon. or because the transposon has integrated into a silent (per- 
haps heterochromatic) genomic region. 

[0120] Restriction analysis of the transformants revealed that, as predicted by the phenotypes (Table 3), two inde- 
pendent transformants were represented among the G1 progeny of cage 3. two in cage 25, and one in cage 33 (Data 
for transformants from cage 33 are not shown. The restriction patterns of three G1 's from cage 1 were identical to these 

35 of the 3.2/3.6 event. Evidently, a GO male present in cage 3 had mated with a GO female of cage 1 , before the GO flies 
were sorted into cages.) Only one of these 5 transformants (25.7) had a second (phenotypically silent) event in the 
same germ line. The different transformants from the same cages are derived either from single or multiple GO parents. 
The overall frequency of phenotypically detectable transformation events (5/390 GO adults) is sufficient for producing 
several transformants from a single experiment since thousands of embryos can be injected and hundreds of GO adults 

40 can be obtained within a week using a relatively simple experimental setup. 

[01 21 ] To confirm the presence of a single Minos insertion in transformant 3. 1 . third instar larva salivary gland poly- 
tene chromosomes were prepared and in situ hybridization were performed essentially as described previously (Zach- 
aropoulou, A., et al., Chromosoma 101 : 448 (1 992)). The 3.7 kb A/of I fragment containing the HsplOlw minigene fusion 
was used as probe. Hybridization to polytene chromosomes of salivary glands from transformed third instar larvae con- 

45 firmed the presence of single Minos insertions, allowing their cytological localization. 

EXAMPLE 2 

MATERIALS AND METHODS 

50 

1. TRANSFECTION OF MAMMALIAN CELLS 

[01 22] Human HeLa cells and green monkey COSI cells were cultured at 37°C in an atmosphere containing 4% C0 2 
in DMEM supplemented with 10% fetal caff serum (FCS) and 50 fig/ml gentamycin. HeLa cells were seeded onto 60 
55 mm dishes (300,000 cells per dish) and COS1 cells were seeded onto 6-well plates (200,000 cells per well) one day 
prior to transfection. 
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2. HELA CELL TRANSFECTIONS 

[0123] HeLa ceils were transfected with Qiagen (Qiagen) and Elutip (Schleicher and Schuell)-purrfied supercoiled 
plasmid DNA in 2.5 ml DMEM supplemented with 2% FCS and 50 jig/ml gentamycin and 0.5 ml of a calcium/HBS pre- 
5 cipitant. according to the calcium chloride procedure described by Sambrook et al. {Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, New York (1989)). 

[0124] HeLa cell were transfected with either 8 pig of transposon plasmid pMiLRneo (Figure 6) alone; a mixture of 8 
ng of the transposon plasmid pMiLRneo (Figure 6) and 2 ng of the helper plasmid pEFI/ILMi (Figure 5); or a mixture of 
8 jig of the "wings clipped" plasmid pMiLneo (Figure 7) and 2 ^g of the helper plasmid pEFI/ILMi (Figure 5). The "wings 

10 clipped" plasmid includes a modified Minos transposon in which one of the inverted repeats has been excised. 

[0125] The pMiLneo transposon plasmid and the "wings clipped" plasmid include a selectable neomycin (neo) resist- 
ance gene, which is under the control of the SV40 early promoter (e.g., operably linked). The neo gene encodes a 
prokaryotic aminoglycoside phosphotransferase that detoxifies the antibiotic G418 which blocks protein synthesis in 
eukaryotic and prokaryotic cells, thereby allowing for selection and growth of colonies containing inserts (transfected 

15 cells). 

[0126] After 16 hours of incubation with DNA, the cells were washed twice with serum-free DMEM and re-fed with 4 
ml of serum containing (10% FCS) DMEM. Two days post-transfection, the cells were trypsinized and seeded onto 1 50 
mm (Experiment #1 , Table 4) or 90 mm (Experiments #2 and #3, Table 4) dishes with DMEM-1 0% FCS containing 600 
jig/ml G418 (Gibco-BRL). After 15 days of selection, cell clones were either isolated and expanded into individual cul- 
20 tures, or fixed in a solution containing 10% (v/v) formaldehyde in PBS for 15 minutes. Rxed cells were stained with 2% 
(w/v) methylene blue in PBS and colonies counted. 

3. COS CELL TRANSFECTIONS 

25 [0127] COS1 cells were prepared for transfection. Transfection selection was performed as described for the HeLa 
cells, except that COS cells were transfected with a mixture of 3 \iq supercoiled pEFI/ILMi helper plasmid DNA (Figure 
5) and 3 \lq supercoiled pQB125 (Quantum) plasmid DNA. Colonies were fixed in 10% formaldehyde in PBS. Expres- 
sion and cellular localization of the plasmid DNA encoding the transposase was determined by indirect immunocyto- 
chemical techniques using a polyclonal antisera for Minos transposase and a rhodamine-conjugated goat anti-rabbit 

30 antibody. 

RESULTS 

1. TRANSPOSASE- INDUCED STABLE TRANSFECTION OF HeLa CELLS WITH A MINOS TRANSPOSON. 

35 

[0128] As shown in Table 4, HeLa cells tranfected with transposon plasmid pMiLRneo in the presence of the helper 
plasmid pEFI/ILMi, which encodes a transposase. resulted in a 19-fold (Experiment #1), a 18-fold (Experiment #2) and 
a 14.5-fold (Experiment #3) increase in the rate of recovery of stable transfectants, compared to transfection with the 
transposon plasmid pMiLRneo alone. 
40 [0129] The transfection depends upon the presence of two functional Minos inverted repeats in the transposon plas- 
mid, since transfection in the presence of the "wings clipped" transposon and the helper plasmid results in numbers of 
colonies roughly equivalent to the number of background colonies produced by transfection with transposon plasmid 
alone. 

[0130] Moreover, the percentage of HeLa cell transfected by the Minos-based system was surprisingly high (e.g. ( 
45 2.5% in Experiment #1) compared to previously described methods used to transfect mammalian cells. Existing trans- 
fection methods generally result in very low percentages of stably transfected cells. In the present experiments, 
between 10" 4 and 1 0" 3 stably transfected cells were obtained by using the calcium co-precipitation method as described 
in Ausubel era/., supra, (see Chapters 9.1 .1 1 and 9.5.1 (1998)). For example, in Experiment #1 , the stable transfection 
efficiency was 25-to 250-fold higher than that obtained by conventional methods. Thus, the present invention is a new 
so and improved method for transfecting cells including mammalian cells. 
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TABLE 4 





Transposon 


Transposon+Helper 


"wings clipped" + Helper 


EXPERIMENT 


A 


B 


A 


B 


A 


B 


#1 


800 


0.13 


-15.000 


2.5 






#2 


250 


0.041 


4,500 


0.75 






#3 


200 


0.033 


2,900 


0.48 


300 


0.020 



A = Number of Transfected Colonies 
B = Percent of Cells Transfected 



is [01 31 ] These data show that co-transfection of a human cell line with a plasmid carrying a Minos transposon and the 
Minos transposase gene results in integration of a nucleic acid sequence. 

[01 32] These data show that both transposase and the presence of two Minos ends are necessary for integration into 
the genome of cells (e.g., mammalian cells). Therefore, the data show that the integration effect of the Minos trans- 
posase is a result of its specific enzymatic function. 

20 [0133] For example, in insect cells, transposases of Type II mobile elements like the Minos transposase, function by 
binding at or near the inverted repeats of the transposon and catalyzing the precise excision of the entire transposon 
(i.e. the DNA flanked by and including the inverted repeats) from its position and precise re-insertion into DNA. Like 
other known elements belonging to the same family of transposons with Minos, insertion of the Minos transposon into 
DNA is not entirely random. The element inserts at a TA dinucleotide via a mechanism that causes duplication of the 

25 target TA. In this way, transposase- mediated integrations of Minos can be characterized by the presence of intact 
inverted repeats flanked by TA dinucleotkJes. Consequently, the molecular basis of the Minos transposon insertions in 
the stably transfected HeLa cells can be determined by Southern blot analysis of the DNA from G41 8 resistant colonies, 
and can be confirmed by cloning and sequencing of individual insertions from these sublines. 

30 2. EXPRESSION AND NUCLEAR LOCALIZATION OF MINOS TRANSPOSASE IN TRANSIENTLY TRANSFECTED 
COSI CELLS 

[0134] Minos transposase was localized in the nuclei of the cells, documenting expression of the Minos transposase 
and transport into the nuclei. Nuclear localization of Minos transposase is consistent with the function of Minos as a 
35 transposase and the presence of several nuclear localization signals consisting of stretches of amino acid residues with 
basic side chains in its primary amino acid sequence. 

3. DETERMINATION OF INTEGRATION EFFICIENCY 

40 [01 35] To determine the efficiency of integration of the Minos transposon into chromosomes after transfection, HeLa 
cells were co-transfected with 1 \lq of the GFP-expressing plasmid pIRES/GFP (Clontech) and either (a) a mixture of 8 
ng of the transposon plasmid pMiLRneo (Figure 6) and 2 \iq of the helper plasmid pEFVILMi (Figure 5); (b) a mixture 
of 8 jig of the transposon plasmid pMiLRneo (Figure 6) and 2 \ig of the plasmid pEF1 ; (c) a mixture of 8 jxg of pMiLRneo 
(lin.) and 2 jig of the helper plasmid pEFI/ILMi (Figure 5); (d) a mixture of 8 \ig of pMiLRneo (lin.) and 2 \ig of the plas- 

45 mid pEF1 ; (e) a mixture of 8 fig of pMiLRneo (dig.) and 2 ng of the helper plasmid pEF1/ILMi (Figure 5); (f) a mixture of 
8 jig of pMiLRneo (dig.) and 2 u,g of the plasmid pEF1 ; or (g) a mixture of 8 ng of the "wings clipped" transposon plasmid 
pMiLneo (Figure 7) and 2 jxg of the helper plasmid pEFI/ILMi (Figure 5). Plasmid pEF1 does not contain the Minos 
transposase gene. The GFP-expressing plasmid contains the green fluorescent protein gene under the control of the 
CMV promoter (e.g., operably linked), but does not contain the inverted repeats of the Minos transposable element. 

so Several controls were included (see Table 5): pMiLRneo(lin.) is the transposon plasmid pMiLRneo (Figure 6) linearized 
with the restriction enzyme Sacl, cleaving the plasmid once outside the transposon; pMiLRneo(dig.) is the transposon 
plasmid pMiLRneo (Figure 6) digested with a combination of Sacl and Kpn\, cleaving the plasmid to the right and to the 
left of the transposon; and plasmid pMiLneo (Figure 7) is the "wings clipped" transposon described above. 
[0136] In each experiment, 2.5 x 10 6 HeLa cells were used. Co-transfection with the GFP-expressing plasmid enabled 

55 an estimate to be made of the minimum fraction of cells in which the uptake of DNA had occurred. The results are 
shown in Table 5. 
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TABLE 5 



Transfection Mixture 


% of cells expressing 


% of cells neo R 


Number of neo R colonies 


pMiLRneo +pEF1 


1.55 


0.250 


6,240 


pMiLRneo +EF1/ILMi 


1.95 


4.118 


102,960 


pMiLRneo(lin.) +pEF1 


0.45 


0.005 


130 


pMiLRneo(lin.) +pEF1/ILMi 


0.63 


0.066 


1.640 


pMiLRneo(dig.) +pEF1 


0.46 


0.005 


125 


pMiLRneo(dig.) +pEF1/ILMi 


0.75 


0.035 


880 


pMiLneo +pEF1/ILMi 


1.60 


0.082 


2,040 



[0137] These results show that the enhanced integration rates observed in the presence of transposase are not the 
result of differential transfection efficiencies between different transfection experiments. 

20 [0138] These results show that the highest efficiency of integration was observed in cells which had been transfected 
with the intact (circular) transposon (pMiLRneo) and the helper plasmid pEFI/ILMi, which encodes a Minos trans- 
posase. The results show that in these transfected cells, the transposon integrated stably in a large fraction of the cells 
in which uptake of DNA had occurred, which was determined by GFP detection. The fraction of cells transiently 
expressing GFP (1.95% maximum) likely represents an underestimate of the cells in which uptake of DNA had 

25 occurred, presumably due to the low sensitivity of GFP detection under the condition of these experiments. HeLa trans- 
fection frequencies of 10% are considered in the art to be "excellent". Taken together with these observations, the 
results suggest that "true" rates of integration of Minos with helper, i.e., the percent of transfected cells in which at least 
one copy of DNA has integrated into the genome of the cells, may be higher than 50%. 

30 4. EXON TRAPPING IN HELA CELLS 

[0139] HeLa cells were co-transfected with 1 \ig of the GFP-expressing plasmid pIRES/GFP (Clontech) and either (a) 
a mixture of 8 fig of the transposon plasmid pMiLRpgeo and 2 jig of the helper plasmid pEF1/ILMi (Figure 5); (b) a mix- 
ture of 8 fig of the transposon plasmid pMiLRpgeo and 2 ng of the plasmid pEF1 ; or (c) a mixture of 8 fig of pMiLRpgeo 

35 (lin.) and 2 \ig of the plasmid pEF1 . 

[0140] The transposon plasmid pMiLRpgeo, consisting of the pgeo cassette flanked by the Minos inverted repeats, 
was derived from the transposon plasmid pMiLRneo (Figure 6). Specifically, the SV40neo cassette was replaced by the 
pgeo cassette (Friedrich and Soriano, Genes Dev., 5:1513-1523 (1991)) (Hind\\\ and EcoRI cloning sites) to produce 
the transposon plasmid pMiLRpgeo. The pgeo cassette contains the first intron (1690 bp), the splice acceptor site and 

40 a portion of the second exon (1 83 bp) of the En-2 gene (Skarnes et a/., Genes Dev., 5:903-91 8 (1 992); the exon is fused 
in-frame to an in-frame fusion consisting of the E. coli LacZ gene, and the neomycin resistance gene and the termina- 
tion of transcription is controlled by the SV40 terminator (240 bp). The pgeo cassette is functional as an exon trap and 
is described in Friedrich and Soriano. Genes Dev., 5:1513-1523 (1991). The neo resistance of plasmid pMiLRpgeo can 
only be expressed if it is integrated into an intron of an active gene, in the appropriate orientation, so that splicing pro- 

45 duces a novel fusion protein with the neo and p-gaiactosidase modules at the carboxy terminus. 

[0141] pMiLRpgeo (lin.) is the transposon plasmid pMiLRpgeo linearized with the restriction enzyme HindWl cleaving 
the plasmid once within the transposon at the 5' end of the En-2 intron. 

[0142] In each experiment, 2.5 x 1 0 6 HeLa cells were used. Co-transfection with the GFP-expressing plasmid enabled 
an estimate to be made of the minimum fraction of cells in which the uptake of DNA had occurred. The results are 
so shown in Table 6. 



TABLE 6 





Transfection Mixture 


% of cells expressing 


% of cells NEO R 


Number of neo R colonies 


55 




GFP 








pMiLRpgeo +pEF1 


2.15 


0.002 


62 
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TABLE 6 (continued) 



Transfection Mixture 


% of cells expressing 
GFP 


% of cells NEO R 


Number of neo R colonies 


pMiLRpgeo +pEF1/ILMi 


1.70 


0.028 


696 


pMiLRpgeo(lin.) +pEF1 


0.58 


0.000 


2 



[0143] In an exon trap experiment, only a small fraction of integrations, namely those in irrtrons and in the correct ori- 
w entation, are expected to express neo activity. The results show that, as expected, the frequency of neo resistant colo- 
nies was many-fold (150X) lower compared to that in the random integration experiment described in the previous 
section (section 4). 

[0144] Twenty-two neo-resistant colonies which had been transfected with the intact (circular) transposon (pMiLRp- 
geo) and the helper plasmid pEFI/ILM, which encodes a Minos transposase, were grown and analyzed histochemi- 
15 cally for (p-galatosidase expression. Of those colonies, 1 6 showed detectable histochemical staining. 

5. MOLECULAR ANALYSIS OF TRANSPOSON INTEGRATIONS PRODUCED IN THE PRESENCE OF TRANS- 
POSASE 

20 [0145] To determine the nature of the integration events, DNA from 1 1 neo resistant colonies transfected with the 
intact (circular) transposon (pMiLRneo) and the helper plasmid pEFI/lLMi (from the random integration experiment 
described in section 4), was analyzed by Southern Wot hybridizations using several restriction enzymes and the SV40- 
neo cassette (see Figure 6) as a probe. 

[0146] Two restriction enzyme combinations were used. The first contained BglW and Xho\ t neither of which cut within 
25 the transposon plasmid DNA. This combination is expected to generate a single band from each integration of the trans- 
poson at different sites of the genome. The second contained the enzyme Bgl\, in addition to BglW and Xhol Bgl\ 
cleaves the transposon plasmid at three position, one within the transposon sequence and the other two in the plasmid 
sequences flanking the transposon, about 1 .5 kb and 0.25 kb from the left and the right end of the transposon, respec- 
tively. TTie respective distances from the internal Bgl\ are 2.15 kb and 1.75 kb, and these are the fragment sizes that 
30 would be detected in the Southern Wots if the sequence inserted contained the original plasmid sequences that flank 
the Minos inverted repeats. 

[0147] Restriction analysis of the transformants revealed that with BglW and Xho\, 6 of the colonies showed 1 band 
each, 3 colonies had 2 bands each, one had 3 bands and one colony had 5 bands. With the three enzymes, the number 
of bands in each of the colonies was douWed. and in all cases the sum of the fragment lengths were equal or smaller 
35 to the lengths of the fragments of the douWe digest. These results, combined together, strongly indicate that the trans- 
poson has integrated into the genome of the lines (colonies) examined in a transposase-dependent manner, i.e. without 
carrying any plasmid sequences. 

[0148] Southern analysis was also performed on 8 neo resistant colonies transfected with the intact (circular) trans- 
poson (pMiLRpgeo) and the helper plasmid pEFVILMi (from the exon trap experiment described in section 5). In these 
40 experiments a combination of two enzymes was used {BglW and Xhol). Two fragments are expected from each inde- 
pendent single insertion of the transposon, because BglW cleaves within the pMiLRpgeo transposon. The results from 
this restriction analysis revealed that there are between two and seven insertions per line, with an average number of 
3.6 insertions per line. 

[0149] Those skilled in the art will recognize or be aWe to ascertain, using no more than routine experimentation, 
45 many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be 
encompassed by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT/ INVENTOR: 

(A) NAME: Institute For Molecular Biology and 

Biotechnology/FORTH 

(B) STREET: Box 1527 

(C) CITY: Heraklion 
10 (D) STATE/ PROVINCE: Crete 

(E) COUNTRY: Greece 

(F) POSTAL CODE/ZIP: 711 10 

(ii) TITLE OF INVENTION: Eukaryotic Transposable Element 

15 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hamilton, Brook, Smith & Reynolds, P.C. 

(B) STREET: Two Militia Drive 
20 (C) CITY: Lexington 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 
<F) ZIP: 02421 

(v) COMPUTER READABLE FORM: 
9 . (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS- DOS 

(D) SOFTWARE: Patentln Release 11.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/067,755 

(B) FILING DATE: 27-APR-1998 
35 (C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Carroll, Alice O. 

(B) REGISTRATION NUMBER: 33,542 

40 (C) REFERENCE/DOCKET NUMBER: IMBB92-01ZA3 EPO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (781) 861-6240 

(B) TELEFAX: (781) 861-9540 



{2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ACGAGCCCCA 


ACCACTATTA 


ATTCGAACAG 


CATGTTTTTT 


TTGCAGTGCG 


CAATGTTTAA 


CACACTATAT 


TATCAATACT 


ACTAAAGATA 


ACACATACCA 


ATGCATTTCG 


TCTCAAAGAG 


AATTTTATTC 


TCTTCACGAC 


GAAAAAAAAA 


GTTTTGCTCT 


ATTTCCAACA ACAACAAAAA 


TATGAGTAAT 


TTATTCAAAC 


GGTTTGCTTA 


AGAGATAAGA 


AAAAAGTGAC 


CACTATTAAT 


TCGAACGCGG 


CGTAAGCTTA 


CCTTAATCTC 


AAGAAGAGCA 


AAACAAAAGC 


AACTAATGTA 


ACGGAATCAT 


TATCTAGTTA 


TGATCTGCAA ATAATGTCAC 


AATACAGCAT 


GCAAAAAAAT 


TTTAGATTGC 


TGCAGATCAG 


TAGAAGTTTA GCAACGATGG 


TTCGTGGTAA 


ACCTATTTCT 


AAAGAAATCA 


GAGTATTGAT 


TAGGGATTAT 


TTTAAATCTG 


GAAAGACACT 


TACGGAGATA 


AGCAAGCAAT 


TAAATTTGCC 


TAAGTCGTCT 


GTGCATGGGG 


TGATACAAAT 


TTTCAAAAAA 


AATGGGAATA 


TTGAAAATAA 


CATTGCGAAT 


AGAGGCCGAA 


CATCAGCAAT 


AACACCCCGC 


GACAAAAGAC 


AACTGGCCAA 


AATTGTTAAG 


GCTGATCGTC 


GCCAATCTTT 


GAGAAATTTG 


GCTTCTAAGT 


GGTCGCAGCA 


ATTGGCAAAA 


CTGTCAAGCG 


AGAGTGGACG 


CGACAAATTA 


AAAAGTATTG 


GATATGGTTT 


TTATAAAGTA 


TGTTTTGTTA 


TTACCTGTGC 


ATCGTACCCA 


ATAACTTACT 


CGTAATCTTA 


CTCGTAGGCC 


AAGGAAAAAC 


CCTTGCTTAC 


GCTTCGTCAA 


AAAAAGAAGC 


GTTTGCAATG 


GGCTCGGGAA 


AGGATGTCTT 


GGACTCAAAG 


GCAATAGGAT 


ACCATCATAT 


TCAGCGATGA 


AGCTAAATTT 


GATGTTAGTG 


TCGGCGATAC 


GAGAAAACGC 


GTCATCCGTA 


AGAGGTCAGA 


AACATACCAT 


AAAGACTGCC 


TTAAAAGAAC 


AACAAAGTTT 


CCTGCGAGCA 


CTATGGTATG 


GGGATGTATG 


TCTGCCAAAG 


GATTAGGAAA 


ACTTCATTTC 


ATTGAAGGGA 


CAGTTAATGC 


TGAAAAATAT 


ATTAATATTT 


TACAAGATAG 


TTTGTTGCCA 


TCAATACCAA 


AACTATCAGA 


TTGCGGTGAA 


TTCACTTTTC 


AGCAGGACGG 


AGCATCATCG 


CACACAGCCA 


AGCGAACCAA 


AAATTGGCTG 


CAATATAATC 


AAATGGAGGT 


TTTAGATTGG 


CCATCAAATA 


GTCCAGATCT 


AAGCCCAATT 


GAAAATATTT 


GGTGGCTAAT 


GAAAAACCAG 


CTTCGAAATG 


AGCCACAAAG 


GAATATTTCT 


GACTTGAAAA 


TCAAGTTGCA 


AGAGATGTGG 


GACTCAATTT 


CTCAAGAGCA 


TTGCAAAAAT 


TTGTTAAGCT 


CAATGCCAAA 


ACGAGTTAAA 


TGCGTAATGC 


AGGCCAAGGG 


CGACGTTACA 


CAATTCTAAT 


ATTAATTAAA 


TTATTGTTTT 


AAGTATGATA 


GTAAATCACA 


TTACGCCGCG 


TTCGAATTAA 


TAGTGGTCAC 


TTTTTTCTTA 


TCTCTTAAGC 


AAACCGTTTG 


AATAAATTAC 


TCATATTTTT 


GTTGTTGTTG 


GAAATAGAGC 


AAAACTTTTT 


TTTTCGTCGT 


GAAGAGAATA 


AAATTCTCTT 


TGAGACGAAA 


TGCATTGGTA 


TGTGTTATCT 


TTAGTAGTAT 


TGATAATATA 


GTGTGTTAAA 


CATTGCGCAC 


TGCAAAAAAA 


ACATGCTGTT 


CGAATTAATA 


GTGGTTGGGG CTCGT 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ACGAGCCCCA ACCACTATTA ATTCGAACAG CATGTTTTTT TTGCAGTGCG CAATGTTTAA 60 

CACACTATAT TATCAATACT ACTAAAGATA ACACATACCA ATGCATTTCG TCTCAAAGAG 120 

AATTTTATTC TCTTCACGAC GAAAAAAAAA GTTTTGCTCT ATTTCCAACA ACAACAAAAA 180 

TATGAGTAAT TTATTCAAAC GGTTTGCTTA AGAGATAAGA AAAAAGTGAC CACTATTAAT 240 

TCGAACGCGG CGTAAGCTTA CCTTAATCTC AAGAAGAGCA AAACAAAAGC AACTAATGTA 300 

ACGGAATCAT TATCTAGTTA TGATCTGCAA ATAATGTCAC AATACAGCAT GCAAAAAAAT 360 

TTTAGATTGC TGCAGATCAG TAGAAGTTTA GCAACGATGG TTCGTGGTAA ACCTATTTCT 4 20 

AAAGAAATCA GAGTATTGAT TAGGGATTAT TTTAAATCTG GAAAGACACT TACGGAGATA 4 80 

AGCAAGCAAT TAAATTTGCC TAAGTCGTCT GTGCATGGGG TGATACAAAT TTTCAAAAAA 540 

AATGGGAATA TTGAAAATAA CATTGCGAAT AGAGGCCGAA CATCAGCAAT AACACCCCGC 600 

GACAAAAGAC AACTGGCCAA AATTGTTAAG GCTGATCGTC GCCAATCTTT GAGAAATTTG 660 

GCTTCTAAGT GGTCGCAGCA ATTGGCAAAA CTGTCAAGCG AGAGTGGACG CGACAAATTA 720 

AAAAGTATTG GATATGGTTT TTATAAAGTA TGTTTTGTTA TTACCTGTGC ATCGTACCCA 780 

ATAACTTACT CGTAATCTTA CTCGTAGGCC AAGGAAAAAC CCTTGCTTAC GCTTCGTCAA 84 0 

AAAAAGAAGC GTTTGCAATG GGCTCGGGAA AGGATGTCTT GGACTCAAAG GCAATGGGAT 900 

ACCATCATAT TCAGCGATGA AGCTAAATTT GATGTTAGTG TCGGCGATAC GAGAAAACGC 960 

GTCATCCGTA AGAGG7CAGA AACATACCAT AAAGACTGCC TT.AAAAGAAC AACAAAGTTT 1020 

CCTGCGAGCA CTATGGTATG GGGATGTATG TCTGCCAAAG GATTAGGAAA ACTTCATTTC 1080 

ATTGAAGGGA CAGTTAATGC TGAAAAATAT ATTAATATTT TACAAGATAG TTTGTTGCCA 1140 

TCAATACCAA AACTATTAGA TTGCGGTGAA TTCACTTTTC AGCAGGACGG AGCATCATCG 1200 

CACACAGCCA AGCGAACCAA AAATTGGCTG CAATATAATC AAATGGAGGT TTTAGATTGG 1260 

CCATCAAATA GTCCAGATCT AAGCCCAATT GAAAATATTT GGTGGCTAAT GAAAAACCAG 1320 

CTTCGAAATG AGCCACAAAG GAATATTTCT GACTTGAAAA TCAAGTTGCA AGAGATGTGG 1380 

GACTCAATTT CTCAAGAGCA TTGCAAAAAT TTGTTAAGCT CAATGCCAAA ACGAGTTAAA 14 40 
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TGCGTAATGC AGGCCAAGGG CGACGTTACA CAATTCTAAT ATTAATTAAA TTATTGTTTT 
AAGTATGATA GTAAATCACA TTACGCCGCG TTCGAATTAA TAGTGGTCAC TTTTTTCTTA 
TCTCTTAAGC AAACCGTTTG AATAAATTAC TCATATTTTT GTTGTTGTTG GAAATAGAGC 
AAAACTTTTT TTTTCGTCGT GAAGAGAATA AAATTCTCTT TGAGACGAAA TGCATTGGTA 
TGTGTTATCT TTAGTAGTAT TGATAATATA GTGTGTTAAA CATTGCGCAC TGCAAAAAAA 
ACATGCTGTT CGAATTAATA GTGGTTGGGG CTCGT 

(2) INFORMATION FOR SEQ ID NO: 3: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1775 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACGAGCCCCA ACCACTATTA ATTCGAACAG CATGTTTTTT TTGCAGTGCG CAATGTTTAA 
CACACTATAT TATCAATACT ACTAAAGATA ACACATACCA ATGCATTTCG TCTCAAAGAG 
AATTTTATTC TCTTCACGAC GAAAAAAAAA GTTTTGCTCT ATTTCCAACA ACAACAAAAA 
TATGAGTAAT TTATTCAAAC GGTTTGCTTA AGAGATAAGA AAAAAGTGAC CACTATTAAT 
TCGAACGCGG CGTAAGCTTA CCTTAATCTC AAGAAGAGCA AAACAAAAGC AACTAATGTA 
ACGGAATCAT TATCTAGTTA TGATCTGCAA ATAATGTCAC AATACAGCAT GCAAAAAAAT 
TTTAGATTGC TGCAGATCAG TAGAAGTTTA GCAACGATGG TTCGTGGTAA ACCTATTTCT 
AAAGAAATCA GAGTATTGAT TAGGGATTAT TTTAAATCTG GAAAGACACT TACGGAGATA 
AGCAAGCAAT TAAATTTGCC TAAGTCGTCT GTGCATGGGG TGATACAAAT TTTCAAAAAA 
AATGGGAATA TTGAAAATAA CATTGCGAAT AGAGGCCGAA CATCAGCAAT AACACCCCGC 
GACAAAAGAC AACTGGCCAA AATTGTTAAG GCTGATCGTC GCCAATCTTT GAGAAATTTG 
GCTTCTAAGT GGTCGCAGCA ATTGGCAAAA CTGTCAAGCG AGAGTGGACG CGACAAATTA 
AAAAGTATTG GATATGGTTT TTATAAAGTA TGTTTTGTTA TTACCTGTGC ATCGTACCCA 
ATAACTTACT CGTAATCTTA CTCGTAGGCC AAGGAAAAAC CCTTGCTTAC GCTTCGTCAA 
AAAAAGAAGC GTTTGCAATG GGCTCGGGAA AGGATGTCTT GGACTCAAAG GCAATGGGAT 
ACCATCATAT TCAGCGATGA AGCTAAATTT GATGTTAGTG TCGGCGATAC GAGAAAACGC 
GTCATCCGTA AGAGGTCAGA AACATACCAT AAAGACTGCC TTAAAAGAAC AACAAAGTTT 
CCTGCGAGCA CTATGGTATG GGGATGTATG TCTGCCAAAG GATTAGGAAA ACTTCATTTC 
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ATTGAAGGGA CAGTTAATGC TGAAAAATAT ATTAATATTT TACAAGATAG TTTGTTGCCA 1140 

TCAATACCAA AACTATCAGA TTGCGGTGAA TTCACTTTTC AGCAGGACGG AGCATCATCG 1200 

CACACAGCCA AGCGAACCAA AAATTGGCTG CAATATAATC AAATGGAGGT TTTAGATTGG 1260 

CCATCAAATA GTCCAGATCT AAGCCCAATT GAAAATATTT GGTGGCTAAT GAAAAACCAG 1320 

CTTCGAAATG AGCCACAAAG GAATATTTCT GACTTGAAAA TCAAGTTGCA AGAGATGTGG 1380 

GACTCAATTT CTCAAGAGCA TTGCAAAAAT TTGTTAAGCT CAATGCCAAA ACGAGTTAAA 14 40 

TGCGTAATGC AGGCCAAGGG CGACGTTACA CAATTCTAAT ATTAATTAAA TTATTGTTTT 1500 

AAGTATGATA GTAAATCACA TTACGCCGCG TTCGAATTAA TAGTGGTCAC TTTTTTCTTA 1560 

TCTCTTAAGC AAACCGTTTG AATAAATTAC TCATATTTTT GTTGTTGTTG GAAATAGAGC 1620 

AAAACTTTTT TTTTCGTCGT GAAGAGAATA AAATTCTCTT TGAGACGAAA TGCATTGGTA 1680 

TGTGTTATCT TTAGTAGTAT TGATAATATA GTGTGTTAAA CATTGCGCAC TGCAAAAAAA 1740 

ACATGCTGTT CGAATTAATA GTGGTTGGGG CTCGT 1775 



(2) INFORMATION FOR SEQ ID NO: 4: 

25 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix| FEATURE: 
35 (A) NAME /KEY: CDS 

(B) LOCATION: join ( 398 751 , 812.. 898) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



50 



ACGAGCCCCA 


ACCACTATTA 


ATTCGAACAG 


CATGTTTTTT TTGCAGTGCG CAATGTTTAA 


60 


CACACTATAT 


TATCAATACT 


ACTAAAGATA 


ACACATACCA ATGCATTTCG TCTCAAAGAG 


120 


AATTTTATTC 


TCTTCACGAC 


GAAAAAAAAA 


GTTTTGCTCT ATTTCCAACA ACAACAAAAA 


180 


TATGAGTAAT 


TTATTCAAAC 


GGTTTGCTTA 


AGAGATAAGA AAAAAGTGAC CACTATTAAT 


240 


TCGAACGCGG 


CGTAAGCTTA 


CCTTAATCTC 


AAGAAGAGCA AAACAAAAGC AACTAATGTA 


300 


ACGGAATCAT 


TATCTAGTTA 


TGATCTGCAA 


ATAATGTCAC AATACAGCAT GCAAAAAAAT 


360 


TTTAGAATTG 


CTGCAGATCA 


GTAGAAGTTT 


AGCAACG ATG GTT CGT GGT AAA CCT 


415 



Met Val Arg Gly Lys Pro 
1 5 



55 



29 



EP0 955 364 A2 



w 



15 



30 



35 



40 



45 



50 



55 



ATT TCT AAA GAA ATC AGA GTA TTG ATT AGG GAT TAT TTT AAA TCT GGA 4 63 

lie Ser Lys Glu lie Arg Val Leu lie Arg Asp Tyr Phe Lys Ser Gly 
10 15 20 

AAG ACA CTT ACG GAG ATA AGC AAG CAA TTA AAT TTG CCT AAG TCG TCT 511 
Lys Thr Leu Thr Glu lie Ser Lys Gin Leu Asn Leu Pro Lys Ser Ser 
25 30 35 

GTG CAT GGG GTG ATA CAA ATT TTC AAA AAA AAT GGG AAT ATT GAA AAT 559 
Val_His Gly Val lie Gin lie Phe Lys Lys Asn Gly Asn lie Glu Asn 
40 45 50 

AAC ATT GCG AAT AGA GGC CGA ACA TCA GCA ATA ACA CCC CGC GAC AAA 607 
Asn lie Ala Asn Arg Gly Arg Thr Ser Ala He Thr Pro Arg Asp Lys 
55 60 65 70 

AGA CAA CTG GCC AAA ATT GTT AAG GCT GAT CGT CGC CAA TCT TTG AGA 655 
Arg Gin Leu Ala Lys He Val Lys Ala Asp Arg Arg Gin Ser Leu Arg 
75 80 85 



AAT TTG GCT TCT AAG TGG TCG CAG ACA ATT GGC AAA ACT GTC AAG CGA 703 
Asn Leu Ala Ser Lys Trp Ser Gin Thr He Gly Lys Thr Val Lys Arg 
20 90 95 100 

GAG TGG ACG CGA CAG CAA TTA AAA AGT ATT GGA TAT GGT TTT TAT AAA 751 
Glu Trp Thr Arg Gin Gin Leu Lys Ser He Gly Tyr Gly Phe Tyr Lys 
105 110 115 

25 GTATGTTTTG TTATTACCTG TGCATCGTAC CCAATAACTT ACT CGT AAT C TTACTCGTAG 811 

GCC AAG GAA AAA CCC TTG CTT ACG CTT CGT CAA AAA AAG AAG CGT TTG 859 
Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg Gin Lys Lys Lys Arg Leu 
120 125 130 



CAA TGG GCT CGG GAA AGG ATG TCT TGG ACT CAA AGG CAA TAGGATACCA 908 
Gin Trp Ala Arg Glu Arg Met Ser Trp Thr Gin Arg Gin 
135 140 145 



TCATATTCAG 


CGATGAAGCT 


AAATTTGATG 


TTAGTGTCGG 


CGATACGAGA 


AAACGCGTCA 


968 


TCCGTAAGAG 


GTCAGAAACA 


TACCATAAAG 


ACTGCCTTAA 


AAG AAC AAC A 


AAGTTTCCTG 


1028 


CGAGCACTAT 


GGTATGGGGA 


TGTATGTCTG 


CCAAAGGATT 


AGGAAAACTT 


CATTTCATTG 


1088 


AAGGGACAGT 


TAATGCTGAA 


AAATATATTA 


ATATTTTACA 


AGATAGTTTG 


TTGCCATCAA 


1148 


TACCAAAACT 


ATCAGATTGC 


GGTGAATTCA 


CTTTTCAGCA 


GGACGGAGCA 


TCATCGCACA 


1208 


CAGCCAAGCG 


AACCAAAAAT 


TGGCTGCAAT 


ATAATCAAAT 


GGAGGTTTTA 


GATTGGCCAT 


1268 


CAAATAGTCC 


AGATCTAAGC 


CCAATTGAAA 


ATATTTGGTG 


GCTAATGAAA 


AACCAGCTTC 


1328 


GAAATGAGCC 


ACAAAGGAAT 


ATTTCTGACT 


TGAAAATCAA 


GTTGCAAGAG 


ATGTGGGACT 


1388 


CAATTTCTCA AGAGCATTGC 


AAAAATTTGT 


TAAGCTCAAT 


GCCAAAACGA 


GTTAAATGCG 


1448 


TAATGCAGGC 


CAAGGGCGAC 


GTTACACAAT 


TCTAATATTA 


ATTAAATTAT 


TGTTTTAAGT 


1508 


ATGATAGTAA 


ATCACATTAC 


GCCGCGTTCG 


AATTAATAGT 


GGTCACTTTT 


TTCTTATCTC 


156S 


TTAAGCAAAC 


CGTTTGAATA 


AATTACTCAT 


ATTTTTGTTG 


TTGTTGGAAA 


TAGAGCAAAA 


1628 
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CTTTTTTTTT CGTCGTGAAG AGAATAAAAT TCTCTTTGAG ACGAAATGCA TTGGTATGTG 1688 

TTATCTTTAG TAGTATTGAT AATATAGTGT GTTAAACATT GCGCACTGCA AAAAAAACAT 17 4 8 

5 

GCTGTTCGAA TTAATAGTGG TTGGGGCTCG T 1779 

(2) INFORMATION FOR SEQ ID NO:5: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



25 



30 



40 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Val Arg Gly Lys Pro lie Ser Lys Glu lie Arg Val Leu lie Arg 
1 5 10 15 

Asp Tyr Phe Lys Ser Gly Lys Thr Leu Thr Glu lie Ser Lys Gin Leu 
20 * 25 30 

Asn Leu Pro Lys Ser Ser Val His Gly Val He Gin He Phe Lys Lys 
35 40 45 

Asn Gly Asn He Glu Asn Asn He Ala Asn Arg Gly Arg Thr Ser Ala 
50 55 60 

He Thr Pro Arg Asp Lys Arg Gin Leu Ala Lys He Val Lys Ala Asp 
65 70 75 80 

Arg Arg Gin Ser Leu Arg Asn Leu Ala Ser Lys Trp Ser Gin Thr He 
85 90 95 

Gly Lys Thr Val Lys Arg Glu Trp Thr Arg Gin Gin Leu Lys Ser He 
100 105 110 

Gly Tyr Gly Phe Tyr Lys Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg 
115 * 120 125 

Gin Lys Lys Lys Arg Leu Gin Trp Ala Arg Glu Arg Met Ser Trp Thr 
130 135 140 

Gin Arg Gin 
145 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

( B) LOCATION: j oin ( 398 . . 751 , 812.. 1480) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACGAGCCCCA ACCACTATTA ATTCGAACAG CATGTTTTTT TTGCAGTGCG CAATGTTTAA 60 

CACACTATAT TATCAATACT ACTAAAGATA ACACATACCA ATGCATTTCG TCTCAAAGAG 120 

AATTTTATTC TCTTCACGAC GAAAAAAAAA GTTTTGCTCT ATTTCCAACA ACAACAAAAA 180 

TATGAGTAAT TTATTCAAAC GGTTTGCTTA AGAGATAAGA AAAAAGTGAC CACTATTAAT 24 0 

15 TCGAACGCGG CGTAAGCTTA CCTTAATCTC AAGAAGAGCA AAACAAAAGC AACTAATGTA 300 

ACGGAATCAT TATCTAGTTA TGATCTGCAA ATAATGTCAC AATACAGCAT GCAAAAAAAT 360 

TTTAGAATTG CTGCAGATCA GTAGAAGTTT AGCAACG ATG GTT CGT GGT AAA CCT 415 

Met Val Arg Gly Lys Pro 

20 1 5 

ATT TCT AAA GAA ATC AGA GTA TTG ATT AGG GAT TAT TTT AAA TCT GGA 4 63 

lie Ser Lys Glu lie Arg Val Leu He Arg Asp Tyr Phe Lys Ser Gly 
10 15 20 

AAG ACA CTT ACG GAG ATA AGC AAG CAA TTA AAT TTG CCT AAG TCG TCT 511 
Lys Thr Leu Thr Glu He Ser Lys Gin Leu Asn Leu Pro Lys Ser Ser 
25 30 35 

GTG CAT GGG GTG ATA CAA ATT TTC AAA AAA AAT GGG AAT ATT GAA AAT 559 
Val His Gly Val He Gin He Phe Lys Lys Asn Gly Asn He Glu Asn 
30 40 45 50 

AAC ATT GCG AAT AGA GGC CGA ACA TCA GCA ATA ACA CCC CGC GAC AAA 607 
Asn He Ala Asn Arg Gly Arg Thr Ser Ala lie Thr Pro Arg Asp Lys 
55 " 60 65 70 

35 AGA CAA CTG GCC AAA ATT GTT AAG GCT GAT CGT CGC CAA TCT TTG AGA 655 

Arg Gin Leu Ala Lys He Val Lys Ala Asp Arg Arg Gin Ser Leu Arg 
75 80 85 

AAT TTG GCT TCT AAG TGG TCG CAG ACA ATT GGC AAA ACT GTC AAG CGA 703 
Asn Leu Ala Ser Lys Trp Ser Gin Thr lie Gly Lys Thr Val Lys Acg 
40 90 95 100 

GAG TGG ACG CGA CAG CAA TTA AAA AGT ATT GGA TAT GGT TTT TAT AAA 751 
Glu Trp Thr Arg Gin Gin Leu Lys Ser lie Gly Tyr Gly Phe Tyr Lys 
105 110 115 

45 GTATGTTTTG TTATTACCTG TGCATCGTAC CCAATAACTT ACTCGTAATC TTACTCGTAG 811 

GCC AAG GAA AAA CCC TTG CTT ACG CTT CGT CAA AAA AAG AAG CGT TTG 859 
Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg Gin Lys Lys Lys Arg Leu 
120 125 130 

50 CAA TGG GCT CGG GAA AGG ATG TCT TGG ACT CAA AGG CAA TGG GAT ACC 907 

Gin Trp Ala Arg Glu Arg Met Ser Trp Thr Gin Arg Gin Trp Asp Thr 
135 140 145 150 
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ATC ATA TTC AGC GAT GAA GCT AAA TTT GAT GTT AGT GTC GGC GAT ACG 955 
He He Phe Ser Asp Glu Ala Lys Phe Asp Val Ser Val Gly Asp Thr 
155 160 165 

AGA AAA CGC GTC ATC CGT AAG AGG TCA GAA AC A TAC CAT AAA GAC TGC 1003 
Arg Lys Arg Val He Arg Lys Arg Ser Glu Thr Tyr His Lys Asp Cys 
170 175 180 

CTT AAA AGA ACA ACA AAG TTT CCT GCG AGC ACT ATG GTA TGG GGA TGT 1051 
Leu Lys Arg Thr Thr Lys Phe Pro Ala Ser Thr Met Val Trp Gly Cys 
185 190 195 

ATG TCT GCC AAA GGA TTA GGA AAA CTT CAT TTC ATT GAA GGG ACA GTT 1099 
Met Ser Ala Lys Gly Leu Gly Lys Leu His Phe He Glu Gly Thr Val 
200 205 210 

AAT GCT GAA AAA TAT ATT AAT ATT TTA CAA GAT AGT TTG TTG CCA TCA 1147 
Asn Ala Glu Lys Tyr He Asn He Leu Gin Asp Ser Leu Leu Pro Ser 
215 220 225 230 

ATA CCA AAA CTA TTA GAT TGC GGT GAA TTC ACT TTT CAG CAG GAC GGA 1195 
He Pro Lys Leu Leu Asp Cys Gly Glu Phe Thr Phe Gin Gin Asp Gly 
235 240 245 



GCA TCA TCG CAC ACA GCC AAG CGA ACC AAA AAT TGG CTG CAA TAT AAT 124 3 

Ala Ser Ser His Thr Ala Lys Arg Thr Lys Asn Trp Leu Gin Tyr Asn 
25 250 255 260 

CAA ATG GAG GTT TTA GAT TGG CCA TCA AAT AGT CCA GAT CTA AGC CCA 1291 
Gin Met Glu Val Leu Asp Trp Pro Ser Asn Ser Pro Asp Leu Ser Pro 
265 270 275 

30 ATT GAA AAT ATT TGG TGG CTA ATG AAA AAC CAG CTT CGA AAT GAG CCA 1339 

He Glu Asn He Trp Trp Leu Met Lys Asn Gin Leu Arg Asn Glu Pro 
280 285 290 



CAA AGG AAT ATT TCT GAC TTG AAA ATC AAG TTG CAA GAG ATG TGG GAC 1387 
Gin Arg Asn He Ser Asp Leu Lys He Lys Leu Gin Glu Met Trp Asp 
295 300 305 310 

TCA ATT TCT CAA GAG CAT TGC AAA AAT TTG TTA AGC TCA ATG CCA AAA 14 35 

Ser He Ser Gin Glu His Cys Lys Asn Leu Leu Ser Ser Met Pro Lys 
315 320 325 

CGA GTT AAA TGC GTA ATG CAG GCC AAG GGC GAC GTT ACA CAA TTC 14 80 

Arg Val Lys Cys Val Met Gin Ala Lys Gly Asp Val Thr Gin Phe 
330 335 340 

TAATATTAAT TAAATTATTG TTTTAAGTAT GATAGTAAAT CACATTACGC CGCGTTCGAA 154 0 

TTAATAGTGG TCACTTTTTT CTTATCTCTT AAGCAAACCG TTTGAATAAA TTACTCATAT 1600 

TTTTGTTGTT GTTGGAAATA GAGCAAAACT TTTTTTTTCG TCGTGAAGAG AATAAAATTC 1660 

TCTTTGAGAC GAAATGCATT GGTATGTGTT ATCTTTAGTA GTATTGATAA TATAGTGTGT 1720 

TAAACATTGC GCACTGCAAA AAAAACATGC TGTTCGAATT AATAGTGGTT GGGGCTCGT 1779 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Val Arg Gly Lys Pro lie Ser Lys Giu lie Arg Val Leu lie Arg 
1 5 10 15 

Asp Tyr Phe Lys Ser Gly Lys Thr Leu Thr Glu lie Ser Lys Gin Leu 
20 25 30 

Asn Leu Pro Lys Ser Ser Val His Gly Val lie Gin He Phe Lys Lys 
35 40 45 

Asn Gly Asn He Glu Asn Asn He Ala Asn Arg Gly Arg Thr Ser Ala 
50 55 60 

He Thr Pro Arg Asp Lys Arg Gin Leu Ala Lys He Val Lys Ala Asp 
65 70 75 80 

Arg Arg Gin Ser Leu Arg Asn Leu Ala Ser Lys Trp Ser Gin Thr He 
85 90 95 

Gly Lys Thr Val Lys Arg Glu Trp Thr Arg Gin Gin Leu Lys Ser He 
100 105 110 

Gly Tyr Gly Phe Tyr Lys Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg 
115 120 125 

Gin Lys Lys Lys Arg Leu Gin Trp Ala Arg Glu Arg Met Ser Trp Thr 
130 ' 135 140 

Gin Arg Gin Trp Asp Thr He He Phe Ser Asp Glu Ala Lys Phe Asp 
145 150 155 160 

Val Ser Val Gly Asp Thr Arg Lys Arg Val He Arg Lys Arg Ser Glu 
165 170 175 

Thr Tyr His Lys Asp Cys Leu Lys Arg Thr Thr Lys Phe Pro Ala Ser 
180 185 190 

Thr Met Val Trp Gly Cys. Met Ser Ala Lys Gly Leu Gly Lys Leu His 
195 ' 200 205 

Phe He Glu Gly Thr Val Asn Ala Glu Lys Tyr He Asn He Leu Gin 
210 215 220 

45 Asp Ser Leu Leu Pro Ser He Pro Lys Leu Leu Asp Cys Gly Glu Phe 

225 230 235 240 

Thr Phe Gin Gin Asp Gly Ala Ser Ser His Thr Ala Lys Arg Thr Lys 
245 250 255 
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Asn Trp Leu Gin Tyr Asn Gin Met Glu Val Leu Asp Trp Pro Ser Asn 
260 265 270 
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Ser Pro Asp Leu Ser Pro lie Glu Asn lie Trp Trp Leu Met Lys Asn 
275 280 285 

Gin Leu Arg Asn Glu Pro Gin Arg Asn lie Ser Asp Leu Lys He Lys 
5 290 295 300 

Leu Gin Glu Met Trp Asp Ser He Ser Gin Glu His Cys Lys Asn Leu 
305 310 315 - 320 

Leu Ser Ser Met Pro Lys Arg Val Lys Cys Val Met Gin Ala Lys Gly 
10 ~ 325 330 335 

Asp Val Thr Gin Phe 
340 



15 



25 



50 
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(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1779 base pairs 

(B) TYPE: nucleic acid 
20 {C) STRANDEDNESS: double 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (398 751, 812. .1480) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

30 

ACGAGCCCCA ACCACTATTA ATTCGAACAG CATGTTTTTT TTGCAGTGCG CAATGTTTAA 60 

CACACTATAT TATCAATACT ACTAAAGATA ACACATACCA ATGCATTTCG TCTCAAAGAG 120 

AATTTTATTC TCTTCACGAC GAAAAAAAAA GTTTTGCTCT ATTTCCAACA ACAACAAAAA 180 

35 TATGAGTAAT TTATTCAAAC GGTTTGCTTA AGAGATAAGA AAAAAGTGAC CACTATTAAT 24 0 

TCGAACGCGG CGTAAGCTTA CCTTAATCTC AAGAAGAGCA AAACAAAAGC AACTAATGTA 300 

ACGGAATCAT TATCTAGTTA TGATCTGCAA ATAATGTCAC AATACAGCAT GCAAAAAAAT 360 

40 TTTAGAATTG CTGCAGATCA GTAGAAGTTT AGCAACG ATG GJT CGT GGT AAA CCT 415 

Met Val Arg Gly Lys Pro 
1 5 

ATT TCT AAA GAA ATC AGA GTA TTG ATT AGG GAT TAT TTT AAA TCT GGA 4 63 

He Ser Lys Glu He Arg Val Leu He Arg Asp Tyr Phe Lys Ser Gly 
45 10 15 20 

AAG ACA CTT ACG GAG ATA AGC AAG CAA TTA AAT TTG CCT AAG TCG TCT 511 
Lys Thr Leu Thr Giu He Ser Lys Gin Leu Asn Leu Pro Lys Ser Ser 
25 30 35 



GTG CAT GGG GTG ATA CAA ATT TTC AAA AAA AAT GGG AAT ATT GAA AAT 559 
Val His Gly Val He Gin He Phe Lys Lys Asn Gly Asn lie Glu Asn 
40 45 50 
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AAC ATT GCG AAT AGA GGC CGA ACA TCA GCA ATA ACA CCC CGC GAC AAA 607 
Asn lie Ala Asn Arg Gly Arg Thr Ser Ala lie Thr Pro Arg Asp Lys 
55 60 65 70 

AGA CAA CTG GCC AAA ATT GTT AAG GCT GAT CGT CGC CAA TCT TTG AGA 655 
Arg Gin Leu Ala Lys He Val Lys Ala Asp Arg Arg Gin Ser Leu Arg 
75 80 85 

AAT TTG GCT TCT AAG TGG TCG CAG ACA ATT GGC AAA ACT GTC AAG CGA 703 
Asix Leu Ala Ser Lys Trp Ser Gin Thr He Gly Lys Thr Val Lys Arg 
90 95 100 

GAG TGG ACG CGA CAG CAA TTA AAA AGT ATT GGA TAT GGT TTT TAT AAA 751 
Glu Trp Thr Arg Gin Gin Leu Lys Ser He Gly Tyr Gly Phe Tyr Lys 
105 110 115 

GTATGTTTTG TTATTACCTG TGCATCGTAC CCAATAACTT ACTCGTAATC TTACTCGTAG 811 

GCC AAG GAA AAA CCC TTG CTT ACG CTT CGT CAA AAA AAG AAG CGT TTG 859 
Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg Gin Lys Lys Lys Arg Leu 
120 125 130 

CAA TGG GCT CGG GAA AGG ATG TCT TGG ACT CAA AGG CAA TGG GAT ACC 907 
Gin Trp Ala Arg Glu Arg Met Ser Trp Thr Gin Arg Gin Trp Asp Thr 
135 140 145 150 

ATC ATA TTC AGC GAT GAA GCT AAA TTT GAT GTT AGT GTC GGC GAT ACG 955 
lie He Phe Ser Asp Glu Ala Lys Phe Asp Val Ser Val Gly Asp Thr 
155 160 165 

AGA AAA CGC GTC ATC CGT AAG AGG TCA GAA ACA TAC CAT AAA GAC TGC 1003 
Arg Lys Arg Val He Arg Lys Arg Ser Glu Thr Tyr His Lys Asp Cys 
170 175 180 

30 CTT AAA AGA ACA ACA AAG TTT CCT GCG AGC ACT ATG GTA TGG GGA TGT 1051 

Leu Lys Arg Thr Thr Lys Phe Pro Ala Ser Thr Met Val Trp Gly Cys 
185 190 195 

ATG TCT GCC AAA GGA TTA GGA AAA CTT CAT TTC ATT GAA GGG ACA GTT 1099 
Met Ser Ala Lys Gly Leu Gly Lys Leu His Phe He Glu Gly Thr Val 
35 2 0 0 2 0 5 2 1 0 

AAT GCT GAA AAA TAT ATT AAT ATT TTA CAA GAT AGT TTG TTG CCA TCA 114 7 

Asn Ala Glu Lys Tyr lie Asn He Leu Gin Asp Ser Leu Leu Pro Ser 
215 220 225 230 

40 ATA CCA AAA CTA TCA GAT TGC GGT GAA TTC ACT TTT CAG CAG GAC GGA 1195 

He Pro Lys Leu Ser Asp Cys Gly Glu Phe Thr Phe Gin Gin Asp Gly 
235 240 245 

GCA TCA TCG CAC ACA GCC AAG CGA ACC AAA AAT TGG CTG CAA TAT AAT 124 3 

Ala Ser Ser His Thr Ala Lys Arg Thr Lys Asn Trp Leu Gin Tyr Asn 
45 250 255 260 

CAA ATG GAG GTT TTA GAT TGG CCA TCA AAT AGT CCA GAT CTA AGC CCA 1291 
Gin Met Glu Val Leu Asp Trp Pro Ser Asn Ser Pro Asp Leu Ser Pro 
265 270 275 

50 ATT GAA AAT ATT TGG TGG CTA ATG AAA AAC CAG CTT CGA AAT GAG CCA 1339 

He Glu Asn He Trp Trp Leu Met Lys Asn Gin Leu Arg Asn Glu Pro 
280 285 290 



55 



36 



EP0 955 364 A2 



CAA AGG AAT ATT TCT GAC TTG AAA ATC AAG TTG CAA GAG ATG TGG GAC 1387 

Gin Arg Asn He Ser Asp Leu Lys He Lys Leu Gin Glu Met Trp Asp 

295 300 305 310 

5 TCA ATT TCT CAA GAG CAT TGC AAA AAT TTG TTA AGC TCA ATG CCA AAA 14 35 

Ser He Ser Gin Glu His Cys Lys Asn Leu Leu Ser Ser Met Pro Lys 
315 320 325 

CGA GTT AAA TGC GTA ATG CAG GCC AAG GGC GAC GTT ACA CAA TTC 14 80 

Arg Val Lys Cys Val Met Gin Ala Lys Gly Asp Val Thr Gin Phe 
10 330 335 340 



15 



25 



30 



35 



40 



45 



50 



TAATATTAAT 


TAAATTATTG 


TTTTAAGTAT 


GATAGTAAAT 


CACATTACGC 


CGCGTTCGAA 


1540 


TTAATAGTGG 


TCACTTTTTT 


CTTATCTCTT 


AAGCAAACCG 


TTTGAATAAA 


TTACTCATAT 


1600 


TTTTGTTGTT 


GTTGGAAATA 


GAGCAAAACT 


TTTTTTTTCG 


TCGTGAAGAG 


AATAAAATTC 


1660 


TCT TTG AG AC 


GAAATGCATT 


GGTATGTGTT 


ATCTTTAGTA 


GTATTGATAA 


TATAGTGTGT 


1720 


TAAACATTGC 


GCACTGCAAA 


AAAAACATGC 


TGTTCGAATT 


AATAGTGGTT 


GGGGCTCGT 


1779 



20 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Val Arg Gly Lys Pro He Ser Lys Glu He Arg Val Leu He Arg 
1 5 10 ~ 15 

Asp Tyr Phe Lys Ser Gly Lys Thr Leu Thr Glu He Ser Lys Gin Leu 
20 25 30 

Asn Leu Pro Lys Ser Ser Val His Gly Val He Gin He Phe Lys Lys 
35 40 45 

Asn Gly Asn He Glu Asn Asn He Ala Asn Arg Gly Arg Thr Ser Ala 
50 55 60 

He Thr Pro Arg Asp Lys Arg Gin Leu Ala Lys He Val Lys Ala Asp 
65 70 75 90 

Arg Arg Gin Ser Leu Arg Asn Leu Ala Ser Lys Trp Ser Gin Thr He 
85 90 95 

Gly Lys Thr Val Lys Arg Glu Trp Thr Arg Gin Gin Leu Lys Ser He 
100 105 110 

Gly Tyr Gly Phe Tyr Lys Ala Lys Glu Lys Pro Leu Leu Thr Leu Arg 
115 120 125 

Gin Lys Lys Lys Arg Leu Gin Trp Ala Arg Glu Arg Met Ser Trp Thr 
130 135 140 

Gin Arg Gin Trp Asp Thr He He Phe Ser Asp Glu Ala Lys Phe Asp 
145 150 155 160 
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Val Ser Val Gly Asp Thr Arg Lys Arg Val lie Arg Lys Arg Ser Glu 
165 170 175 

Thr Tyr His Lys Asp Cys Leu Lys Arg Thr Thr Lys Phe Pro Ala Ser 
5 1B0 185 190 

Thr Met Val Trp Gly Cys Met Ser Ala Lys Gly Leu Gly Lys Leu His 
195 200 ■ 205 

Phe He Glu Gly Thr Val Asn Ala Glu Lys Tyr He Asn He Leu Gin 
10 "210 215 220 

Asp Ser Leu Leu Pro Ser He Pro Lys Leu Ser Asp Cys Gly Glu Phe 
225 230 235 240 

Thr Phe Gin Gin Asp Gly Ala Ser Ser His Thr Ala Lys Arg Thr Lys 
15 245 250 255 

Asn Trp Leu Gin Tyr Asn Gin Met Glu Val Leu Asp Trp Pro Ser Asn 
260 265 270 

Ser Pro Asp Leu Ser Pro He Glu Asn lie Trp Trp Leu Met Lys Asn 
20 2 7 5 2 8 0 2 8 5 

Gin Leu Arg Asn Glu Pro Gin Arg Asn He Ser Asp Leu Lys He Lys 
290 295 300 

Leu Gin Glu Met Trp Asp Ser He Ser Gin Glu His Cys Lys Asn Leu 
25 305 310 315 320 

Leu Ser Ser Met Pro Lys Arg Val Lys Cys Val Met Gin Ala Lys Gly 
325 330 335 

Asp Val Thr Gin Phe 
30 340 

(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Trp Gly Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

5 Trp Pro Ser Gin Ser Pro Asp Leu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
10 - (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

20 Trp Pro Ser Asn Ser Pro Asp Leu 

1 5 



25 

Claims 

1 . A method for inducing a mutation in a cell, comprising the steps of: 

30 a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 

sequence of SEQ ID NO: 1 or SEQ ID NO:4; and 

b) introducing the isolated transposable element of step a) into the cell in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
35 SEQ ID NO: 1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO: 4, optionally wherein the trans- 
posable element is modified to include a promoter operably linked to an indicator gene under the control 
of said promoter flanked by the inverted terminal repeats of the isolated transposable element and/or the 

40 transposable element and nucleic acid sequence encoding the transposase protein are incorporated into 

a viral vector. 

2. A method for isolating a gene of interest in a cell which includes a mutation, comprising the steps of: 

45 a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 

sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to include a pro- 
moter operably linked to an indicator gene under the control of said promoter flanked by the inverted terminal 
repeats of the isolated transposable element; 

b) introducing the isolated transposable element of step a) into a population of cells in the presence of: 

50 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
SEQ ID NO: 1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO: 1 or SEQ ID NO:4, thereby producing a sam- 

55 pie; 

c) detecting expression of th indicator gene in the sample obtained in step b), thereby identifying ceils in which 
the transposable element has integrated into the genome of the cells: 
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d) selecting from among the cells identified in step c) cells which have a mutation in a gene of interest; and 

e) isolating the gene of interest which includes the mutation from the cells identified in step d), optionally : (a) 
wherein the indicator gene is a selected from the group consisting of: 

5 a) a selectable marker gene; and 

b) a reporter gene, and/or (b) wherein the transposable dement and nucleic acid sequence encoding the 
transposase protein are incorporated into a viral vector; and/or (c) wherein the cell is an animal somatic or 
germ line cell. 

10 3. A method for selecting an insertional mutation in a gene in a cell, comprising the steps of: 

a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 
sequence of SEQ ID NO:1 or SEQ ID NO:4, the isolated transposable element being modified to include a min- 
imal promoter operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated 

is transposable element; 

b) introducing the isolated transposable element of step a) into a population of cells in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
SEQ ID NO:1 or SEQ ID NO:4; or 
20 ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 

the ability to hybridize to the DNA sequence of SEQ ID NO: 1 or SEQ ID NO:4, thereby producing a sam- 
ple; 

c) detecting expression of the indicator gene in the sample obtained in step b), thereby identifying cells in which 
25 the transposable element has integrated near or within a gene; and 

d) isolating from the cells identified in step c) the gene in which the transposable element has integrated near 
or within, optionally : (a) wherein the minimal promoter is a TATA box; and/or (b) wherein the transposable ele- 
ment and nucleic acid sequence encoding the transposase protein are incorporated into a viral vector; and/or 
(c) wherein the cell is an animal somatic or germ line cell. 

30 

4. A method for selecting an insertional mutation in a gene in a cell, comprising the steps of: 

a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 
sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to include a 

35 splice acceptor site operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated 

transposable element; ( 

b) introducing the isolated transposable element of step a) into a population of cells in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
40 SEQ ID NO:1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO: 1 or SEQ ID NO:4, thereby producing a sam- 
ple; 

45 c) detecting expression of the indicator gene in the sample obtained in step b), thereby identifying cells in which 

the transposable element has integrated near or within a gene; and 

d) isolating from the cells identified in step c) the gene in which the transposable element has integrated near 
or within, optionally : (a) wherein the transposable element and nucleic acid sequence encoding the trans- 
posase protein are incorporated into a viral vector, wherein the cell is an animal somatic or germ line cell. 

50 

5. A method for reversing a mutation in a gene of interest, obtained according to the method of Claim 2, comprising 
introducing into cells identified in step d): 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of SEQ 
55 ID NO: 1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by the 
ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, optionally : (a) wherein the nucleic 
acid sequence encoding the transposase protein is incorporated into a viral vector, and/or (b) wherein the cells 
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are animal somatic or germ lin cells. 

6. A method for reversing a mutation in a gene, obtained according to the method of Claim 3, comprising introducing 
into cells identified in step c): 

5 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of SEQ 
ID NO: 1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by the 
ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, optionally : (a) wherein the nucleic 

10 acid sequence encoding the transposase protein is incorporated into a viral vector; and/or (b) wherein the cells 

are animal somatic or germ line cells. 

7. A method for reversing a mutation in a gene, obtained according to the method of Claim 4, comprising introducing 
into cells identified in step c): 

15 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of SEQ 
ID NO:1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by the 
ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4; optionally : (a) wherein the nucleic 

20 acid sequence encoding the transposase protein is incorporated into a viral vector; and/or (b) wherein the cells 

are animal somatic or germ line cells. 

8. A method for introducing a reversible mutation in a gene of interest in a cell, comprising the steps of: 

25 a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 

sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to include: 

i) a promoter operably linked to an indicator gene flanked by the inverted terminal repeats of the isolated 
transposable element; or 

30 ii) a minimal promoter operably linked to an indicator gene flanked by the inverted terminal repeats of the 

isolated transposable element; or 

iii) a splice acceptor site operably linked to an indicator gene flanked by the inverted terminal repeats of 
the isolated transposable element; 

35 b) introducing the isolated transposable element of step a) into a gene of interest, thereby producing a mutated 

gene; 

c) introducing the mutated gene of step b) into a population of cells under conditions sufficient for homologous 
recombination between the mutated gene and the corresponding endogenous gene, thereby producing a sam- 
ple; and 

40 d) selecting from the sample obtained in step c) cells in which the endogenous gene has been replaced the 

mutated gene, optionally : (a) wherein the indicator gene is selected from the group consisting of: 

a) a reporter gene; and 

b) a selectable marker gene; and/or (b) wherein the minimal promoter is a TATA box; and/or (c) wherein 
45 the transposable element is incorporated into a viral vector; and/or (d) wherein the cell is an animal 

somatic or germ line cell. 

9. A method for reversing a mutation in a gene, obtained according to the method of Claim 8, comprising introducing 
into cells identified in step d): 

so 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of SEQ 
ID NO:1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic add sequence characterized by the 
ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, optionally : (a) wherein the nucleic 

55 acid sequence encoding the transposase protein is incorporated into a viral vector; and/or (b) wherein the ceils 

are animal somatic or germ lin cells. 

10. A method for inducing loss of a nucleic acid sequence of interest which was integrated into the chromosome of a 
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cell according to a method comprising the steps of: 

a) providing an isolated transposable lement having a nucleic sequence which hybridizes to the DNA 
sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to includ the 

5 nucleic acid sequence of interest flanked by the inverted terminal repeats of the isolated transposable element; 

and 

b) introducing the isolated transposable element of step a) into the cell in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
to SEQ ID NO:1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, thereby producing a cell 
comprising the nucleic acid sequence of interest integrated into its chromosome, 

15 wherein said method comprises introducing into the cell comprising the nucleic acid sequence of interest inte- 

grated into its chromosome: 

1) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
SEQ ID NO:1 or SEQ ID NO:4; or 

20 2) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 

the ability to hybridize to the DNA sequence of SEQ ID NO: 1 or SEQ ID NO:4, optionally : (a) wherein the 
nucleic acid sequence encoding the transposase protein is incorporated into a viral vector; and/or (b) 
wherein the cells are animal somatic or germ line cells; wherein the cells are somatic or germ line cells of 
a transgenic animal; and/or (d) wherein the isolated transposable element is modified to include the gene 

25 of interest operably linked to a promoter and an indicator gene under the control of said promoter. 

1 1 . A method of producing a transgenic plant, comprising the steps of: 

a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 
30 sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to include the 

nucleic acid sequence of interest flanked by the inverted terminal repeats of the isolated transposable element; 

b) introducing the isolated transposable element of step a) into a plant cell in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 
35 SEQ ID NO:1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4; and 

c) cultivating the transformed plant cell obtained in step b) under conditions appropriate for regeneration of a 
40 plant, thereby producing the transgenic plant, optionally : (a) wherein the nucleic acid sequence of interest 

encodes a protein of interest; and/or (b) wherein the isolated transposable element is modified to include the 
nucleic acid sequence of interest operably linked to a promoter and an indicator gene under the control of said 
promoter; and/or (c) wherein the nucleic acid sequence of interest is sselected from the group consisting of: 

45 a) a reporter gene; and 

b) a selectable marker gene; and/or (d) wherein the transposable element and the nucleic acid sequence 
encoding the transposase protein are incorporated into a viral vector. 

12. A method of producing a transgenic animal and progeny thereof, comprising the steps of: 

50 

a) providing an isolated transposable element having a nucleic acid sequence which hybridizes to the DNA 
sequence of SEQ ID NO:1 or SEQ ID NO:4, the isolated transposable element being modified to include the 
nucleic acid sequence of interest flanked by the inverted terminal repeats of the isolated transposable element; 
and 

55 b) introducing the isolated transposable element of step a) into a germ line cell of an animal in the presence of: 

i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequenc of 
SEQ ID NO: 1 or SEQ ID NO:4; or 
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ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to th DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, optionally : (a) wherein the 
nucleic sequenc of interest encodes a protein of interest; and/or (b) wherein the isolated transposable 
element is modified to include the nucleic acid sequence of interest operably linked to a promoter and an 
5 indicator gene under the control of said promoter; and/or (c) wherein th nucleic acid sequence of interest 

is selected from the group consisting of: 

a) a reporter gene; and 

b) a selectable marker gene; and/or (d) wherein the transposable element and the nucleic acid 
10 sequence encoding the transposase protein are incorporated into a viral vector. 

13. A method for integrating a nucleic acid sequence of interest into the chromosome of a cell, comprising the steps of: 

a) providing an isolated transposable element having a nucleic sequence which hybridizes to the DNA 
15 sequence of SEQ ID NO: 1 or SEQ ID NO:4, the isolated transposable element being modified to include the 

nucleic acid sequence of interest flanked by the inverted terminal repeats of the isolated transposable element; 
and 

b) introducing the isolated transposable element of step a) into the cell in the presence of: 

20 i) a transposase protein encoded by a nucleic acid sequence which hybridizes to the DNA sequence of 

SEQ ID NO: 1 or SEQ ID NO:4; or 

ii) a nucleic acid sequence encoding a transposase protein, the nucleic acid sequence characterized by 
the ability to hybridize to the DNA sequence of SEQ ID NO:1 or SEQ ID NO:4, thereby producing a sam- 
ple; optionally : (a) further comprising selecting from the sample obtained in step b) cells in which the 

25 transposable element has integrated into the chromosome; and/or (b) wherein the nucleic acid sequence 

encoding the transposase protein is integrated into the genome of the cell prior to the transposable ele- 
ment containing the nucleic acid sequence of interest; and/or (c) wherein the cell is an animal somatic or 
germ line cell; and/or (d) wherein the nucleic acid sequence of interest encodes a protein of interest, 
and/or (e) wherein the isolated transposable element is modified to include the nucleic acid sequence of 

30 interest operably linked to a promoter and an indicator gene under the control of said promoter (wherein 

for example the indicator gene is selected from the group consisting of: 

a) a reporter gene; and 

b) a selectable marker gene) ; and/or (f) wherein the nucleic acid sequence of interest is selected from 
35 the group consisting of: 

a) a reporter gene; and 

b) a selectable marker gene; and/or (g) wherein the transposable element and the nucleic acid 
sequence encoding the transposase protein are incorporated into a viral vector. 

40 
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